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Preface 



I was invited by the Mathematics Department of the University of Nevada, 
Las Vegas to give ten lectures on geometrical probability during the week June 
9-13, 1975. Those invited to attend the lectures were given an extensive set of 
notes to go along with the presentations. This monograph is a version of the 
content of those lectures and the lecture notes. 

Geometrical probability has enjoyed a resurgence of interest in recent years. 
The lectures and subsequently this monograph in no way cover all topics that 
could fall legitimately under such a heading. The material here is on topics I have 
presented in several classes in geometrical probability at Stanford University. 
Several students who attended these classes have written papers and doctoral 
dissertations on the subject and in this way have helped me prepare this exposi- 
tion. In particular I would like to single out David Berengut, Peter Cooke, Stuart 
Dufour and Andrew F. Siegel whose efforts have made possible several sections 
of the monograph. My thanks are offered to my colleague, Persi Diaconis, for 
editing and reviewing the early portions of the manuscript. 

I am very grateful to Dr. Aaron Goldman of the University of Nevada for 
initiating and directing the Conference. Participants at the Conference exhibited 
a lively interest in the subject and I am grateful to them for this stimulation. My 
thanks go also to the Office of Naval Research and the Army Research Office for 
their support over the years that led to the preparation of the notes that were 
made available to the participants at the Conference. 
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CHAPTER 1 

Buffon Needle Problem, Extensions, and 
Estimation of n 



The Buffon needle problem which many of us encountered in our college or 
even high school days has now been with us for two hundred years. One major 
aspect of its appeal is that its solution has been tied to the value of ir which can 
then be estimated by physical simulation of the model as was done by a number 
of investigators in the late 19th and early 20th centuries — and by computer 
simulation today. Shortly we go into some detail on enlarging the experimental 
design of the model. Then we discuss a number of ways in which modern 
statistical procedures can yield estimates of tt from these experimental designs 
with much better precision than the original Buffon procedure employed by 
many for the estimation of jr. In this sense we will be featuring the statistician in 
geometrical probability and this will continue, where possible, for other topics 
we discuss in this exposition. 

It is interesting that in the original development, Buffon (1777) extols 
geometry as a companion tool to the calculus in establishing a science of 
probability and suggests that chance is amenable to the methods of geometry as 
well as those of the calculus. Buffon indicates that the human mind, because of 
prior mathematics, preferred numbers to measures of area but that the invention 
of games revolving around area and ratios of areas could rectify this. To high- 
light this point he investigated a game already in practice in the 18th century 
known as "clean tile". 

In a room tiled or paved with equal tiles, of any shape, a coin is thrown 
upwards; one of the players bets that after its fall the coin will rest cleanly, i.e., 
on one tile only; the second bets that the coin will rest on two tiles, i.e., that it 
will cover one of the cracks which separate them; a third player bets the coin will 
rest over 3, 4, or 6 cracks: it is required to find the chances for each of these 
players. 

Buffon investigates this game for square tiles, tiles shaped as equilateral 
triangles, hexagonal tiles, and diamond shaped tiles; in each case he is interested 
in the ratios of the diameter of the coin to the equal sides of the particular 
shaped tile that provides a fair game for each player. In effect, the thrust of his 
work on geometrical probability is the development of fair games. 

As a special case, Buffon in his own words states, "I assume that in a room, the 
floor of which is merely divided by parallel lines, a stick is thrown upwards and 
one of the players bets the stick will not interesect any of the parallels on the 
floor, whereas on the contrary the other one bets the stick will intersect some 
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one of these lines; it is required to find the chances of the two players. It is 
possible to play this game with a sewing needle or a headless pin." 

He then demonstrates that for a fair game between two players, the ratio of 
the length of the needle, /, to the distance between the parallel lines, d, (d>l) 
must equal tt/4 for this provides the probability of an intersection equal to 5- 
This can be seen easily from the Buffon needle result as we now know it, namely 
the probability of an intersection, p = H/ivd). 

In the Buffon model a needle (line segment) is dropped "at random" on the 
grid of equidistant parallel lines in the plane. The notion that random elements 
are geometric objects such as line segments, lines in the plane, circles, rectangles, 
triangles, etc. requires that a measure be defined for such elements before 
probabilistic assertions can be made. The Bertrand paradox at the turn of the 
century suggested a situation wherein future developments could be stymied 
because of a lack of a natural choice of measure. In that paradox, the probability 
that a random chord in a circle exceeds the side of an inscribed equilateral 
triangle can be shown to be \, 3, or \ for each of three different models by which 
the chord is drawn at random. Integral geometry becomes helpful here in 
establishing appropriate models. 

Randomness models will play an important role throughout. As a second 
example, the question of whether pairs of chromosomes are randomly dis- 
tributed in the nucleus of a cell during mitosis is translated into a geometrical 
probability problem regarding the expected number of intersections of n pairs of 
chords in a circle. Naturally, the expected number depends on the randomization 
model for obtaining chords in a circle and we develop six models yielding six 
solutions. 

The Bertrand paradox led to proposals by Poincare and others that probability 
statements for geometric situations be tied to densities that would be invariant 
under appropriate transformations. For our purposes, the group of rigid 
motions, that is, transformations that provide invariance under translation and 
rotation will serve our interest. For the Bertrand paradox, the appropriate 
density under the group of rigid motions leads to the solution that the probability 
is 2. The other two solutions induce densities that are not invariant under the 
group of rigid motions. 

One of the prime developers of integral geometry and its consequences for 
geometrical probability is L. A. Santalo. We will call on his developments and 
results for questions of invariance of measure and density where appropriate. 
Santalo has a prolific and prodigious output that is referenced in his most recent 
book (1976). That volume is must reading for any student of the subject for its 
exposition of results to date and the extensive bibliography it contains. 

The Buffon needle problem. A needle (line segment) of length / is dropped "at 
random" on a set of equidistant parallel lines in the plane that are d units apart, 
l^d. 

Uspensky (1937) provides a proof that the probability of an intersection is 
p = 2l/{ird). He develops this by considering a finite number of possible posi- 
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tions for the needle as equally likely outcomes and then treats the limiting case 
as a representation of the problem. This includes a definition of randomness for 
the distance x of the needle's midpoint to the nearest line and the acute angle <p 
formed by the needle and a perpendicular from the midpoint to the line. The 
solution is obtained by computing the ratio of favorable outcomes to the total set 
of outcomes and passing to the limit. 

A simple way of obtaining the answer is to employ a density that seems 
intuitively satisfactory and which turns out to be the invariant density under the 
group of rigid motions. This approach follows. The measure of the set of total 
outcomes is 



(1.1) 



/• 7T/2 

J 



d/1 



dx dip = — . 



From Fig. 1.1 we evaluate the measure of the set of favorable cases (inter- 
sections) as 



(1.2) 
therefore 



t/2 



(1/2) cos v 



dxd<p=-; 



(13) = //2 - 2/ 

Trd/4 Trd 




Fig. l.l 



Laplace extension of Buffon problem. Consider two sets of parallel lines over 
the plane where one set is orthogonal to the other. We now wish to find the 
probability that the needle dropped at random intersects a line of the grid. 
Assume the needle is shorter than the smaller sides of the congruent rectangles 
formed over the plane. This is the Laplace extension. First we find the proba- 
bility that the needle is contained in one of the rectangles of the set. 

Let a, b be the sides of the rectangle that contains the midpoint of the needle 
whose length is / (l<a, l<b). The position of the needle is determined by the 
coordinates x, y of its midpoint and, as before, the angle <p formed by the needle 
with the x-axis. Our randomness model suggests we consider x, y, (p as three 
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independent variables each with uniform distribution of probability over their 
ranges O^xSa; OSy^ft; and -tt/2^<p^tt/2. Thus the domain is a paral- 
lelepiped for a uniform distribution of the point x, y, <p. 

The volume of the domain representing positions of the needle entirely within 
the rectangle is 

rn/2 

(1.4) V*=\ F(<p)d<p = irab-2bTT-2al + l 2 



r(<p)U<p = TTUD — LOTT — LUl T ' 
-tt/2 

where 



(1.5) F(<p) = ab—bl cos <p — a/|sin (p| + 2/ 2 |sin 2<p\, 

and the volume of the total domain V = nab. This is developed in Uspensky 
(1937, p. 255). 
Therefore 



(1.6) i-p = — =1- 



V* 1 2l(a + b)-l 



2 



V irab 

and of course, the probability for the needle to intersect the perimeter of one of 
the rectangles is 

If a = 6 = 1, 



(1.8) 



4/-/ 2 



This provides another approach for estimating ir where / < a, l<b. 

In an interesting article Schuster (1974) develops the orthogonal lines grid a 
bit further from the point of view of experimental design. He raises a question 
about the bright student who drops a needle of length L on a grid of orthogonal 
lines separated by distance 2L and repeats it until 100 observations are made of 
intersections with, say, lines parallel to the a: -axis and these same drops are 
employed to count intersections with, say, lines parallel to the y-axis. How does 
the estimate of tt from this experiment differ from that obtained by the average 
student who drops the needle 200 times on a grid of parallel lines separated by 
distance 2L. If the x intersections and y intersections are independent, the bright 
student has accomplished the same purpose with half the drops. 

Let A be the event-intersection with the x axis and let 

fl if intersection with the x axis, 
x = \ 

[0 otherwise. 

Similarly for B and the y axis and let 

\\ if intersection with the y axis, 
otherwise. 
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We now examine P(AB) and P(A) ■ P(B). From the original Buffon result we 
have P(A) = P(B)=\/7r. We can deduce P(AB) from the Laplace result as 
follows: 

(1.9) P(AB)=\-P(AB')-P(A'B)-P(A'B') 

and we have shown 

(uo, w . in . 1 -2S«±»hl!. 1 _«a^_ 1 _-L. 

Trab 41 tt 4tt 

Thus 

(1.11) P(AB)=l-(l-~)-P(AB')-P(A'B). 

V 4tt/ 

Now 

(1.12) P{A) = - = P(AB) + P(AB'), 

P(B) = - = P(BA) + P(BA'), 

TT 

P(AB') = P(BA') =—-P(AB), 

TT 

P(AB) = -^—- + 2P(AB), 

4tt tt 

2 1 1 
P(AB) = — — = — . 

tt 4tt 4tt 

In fact, then P(AB') = P(A'B)=3/(4tt). However, we now have 
PiAB)^ P(A)P(B) and so there is no independence. 
Yet we are still interested in the efficiency of 

1 100 

(1-13) P=— X (x. + y,). 

ZUU i = i 

Let us find Var (x t + y,): We have 

Var (xj + y,) = Var x, + Var y, + 2 Cov x t yi, 

(1.14) Cov Xi y, = £(;c,y,) -£(*,- )£(y,), 

Cov x,y, = P(Xj = 1, y t = 1) . 

77 77 

From the previous result, P(x, = 1, y* = 1)= 1/(477), 

1 1 tt-4 

(1.15) Covx,y,=- i = -r^<0, 

477 77 477 
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(1.16) Var(*,- + y,.) = -(l--) + (-)(l- 1 ) + 2 -— - 2f --^ 

7T \ 77V \Tr/ V 7T/ 147T 7T J Z7T 

for Var x, = Var y< = (1/tt)(1 - 1/tt). Thus 

(LI7) Var ^p5o?< 100 >(^)=S- - 000976 - 

Now for the simple Buffon problem, we have 

(1.18) p=— X x,, Varp=— 2(M)Varxj=— -11 , Varp= — — . 

Mj = l M M TT\ TTl M 

Equate Var p and Var p and we get 

217 

(1.19) -^— = 0.000976, M = 222. 

M 

Thus 100 independent observations with respect to both grids contain the 
same amount of information about P as 222 observations with respect to one set 
of grid lines. 

Schuster raises another query: Is there an angle of intersection of the grid lines 
such that independence is achieved, i.e. an angle a such that 

(1.20) P(AB;a) = P(A;a)P(B\a) = \. 

He demonstrates that 

(1.21) P(AB;a) = f(a) = {(Tr/2-a)cosa+sma}/(4Tr) 

and that P(AB; a) = 1/tt 2 when a = 0.76605 radians (43.89°). Let us find when 
P(AB) is minimized. Differentiating we find that a = it/2 minimizes /(a). Thus 
l/(47r) is the minimum value for P(AB) and it occurs when the grid lines are 
orthogonal. 

Statistical estimation procedures for tt in the Buffon model. Perlman and 
Wichura (1975) investigate a number of statistical estimation procedures for -tt 
for the single grid and the double grid and place Schuster's results and some 
earlier results by Mantel (1953) within the framework of their development. 
From Buffon we have 

(1.22) p = — = 2rd 

ITU 

where 9 = 1/tt and r = l/d. It is more appropriate to estimate 6 for we avoid 
some pitfalls in treating asymptotic variances of our estimates in this way, yet 
estimating 1/tt or ir gives us the same information. The restriction OSipSl 
imposes ^ 6 g 1/(2/-). If n independent throws of the needle, /, § / § d result 
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in N crossings in the single grid then N is binomially distributed with parameters 
«, p and 

N 
(1.23) <?, = — 

2rn 

is an estimate for 0, and E(8\)= 0. Since Af is a complete sufficient statistic for 0, 
the Rao-Blackwell-Lehmann-Scheffe theorems imply 0, is a uniformly mini- 
mum variance unbiased estimator (UMVUE) of 0. Furthermore 0, is a maxi- 
mum likelihood estimator (MLE) of 6 and has 100% asymptotic efficiency in this 
experiment. Now 

4r n n \p 

We can minimize Var (0,) by taking p as close to 1 as possible, regardless of 6, 
i.e. choose needle length / = d for the best precision in estimating 0. In this case, 
p = 20. 

(1.25) /iVar(0,)=0 2 (^-l). 

An application of the 5 -method shows that Buffon's estimator 

(1.26) 77!= 1/0, 

is an asymptotically unbiased 100% efficient estimator with asymptotic variance 

(1.27) AVar( 7 n)=77 4 Var(<? 1 ) = ~(^-l N )= — , (employ tt = 3.1416). 

n \2 I n 

This result was derived previously by Mantel (1953). 

For the double grid in Laplace's experiment (l<d) suppose we have N A 
crossings of the A lines, TVs crossings of the B lines and therefore &a = 
N A /(2rn), 6 B =N B /(2rn) have the same distributions as 0,. Schuster proposed 
the estimator 

(1.28) 62 J-*±k = *!*±N*. 

2 Arn 

If iV A and N B are independent then the efficiency of 02 is twice that of 0,; 
however the efficiency is more than doubled because N A and N B are negatively 
correlated (antithetic variables). Laplace long ago obtained the crossing proba- 
bilities 

Pa, p B , Pa B , Pab, Pa B , Pas 
and these have already been discussed. We have 

(1.29) PAB = l-4r0 + r 2 0, where / = d = 1 
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and we know p A -PB = 2r0, so 

(1.30) PAB=PA+PB+PAB-l = r 2 6 

and 

1 = Pa+ Pb + PAS - Pab = Pab + Pab + Pab + PAb + PAS ~ Pab- 
Introduce the indicator 

1 if an A-line is crossed on /th throw, 



(0 if not 
and similarly define I,(B). Then N A =-^ =1 1,(A), N B =l7=i ^(B), 

(1.31) 6 2 = -±- I [/,(A) + /,(S)]. 

The n pairs [7,(A), Ii(B)] are independent but 7,(A) and /,(#) are dependent and 
so 



(1.32) 



Var (&) = -4— [Var J,(A) + Var I,(B)+2 Cov (J) (A), /,(J3))] 
16 m 



«Ur 8 



Once again 2 has the greatest efficiency when r = \ (l = d). When r=\, 
Pab = 1-30 or OS §3 or 77- S3. The latter inequality for it arises without any 
experimentation. A little later on we get sharper bounds on tt without any 
experimentation. When r = 1, we have 

(1.33) «Var(0 2 2 ) = 2 [^-l 



and so 



(1.34) AVar(^) = ^(f-l)=H6. 

n \ 8 / n 

By doubling the grid we obtain an estimator tt 2 , that is, 5.63/1.76 = 3.20 times 
as efficient as tt\. One throw of the needle / = d on the double grid contains at 
least 3.20 times the statistical information about values of -rr as one throw onto 
the single grid. 

Perlman and Wichura now ask whether one can do better. The answer is yes 
for we shall see that a complete and sufficient statistic exists for this experiment 
but §2 is not a function of this statistic. 

Full information obtained from n throws of the needle onto the double grid 
can be summarized by the vector statistic 

(1.35) N = (Nab, Nab, N AB , Nab)- 
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Clearly N has the multinomial distribution with probabilities 

Pab. P.ab. Pab- Pab- 
PAB = l-4r0 + r0. p AB = re. 
Now 

Pab — Pab — Pa~ Pab- 
The probability distribution of N is given by (A? and n are vector-valued) 
p B (N = n) = c(n)(p AB ) n «npAB) nAB (PABT B *(PAB) nAe 

c{n) = 



(1.36) 


(1.37) 


(1.38) 


Since 


(1.39) 



(n AB )!(«.4B)!(«Afl)!(«.4B)!' 

m = Ar ~ r~. 

n.AB = n- (n AB + n AB + n A s) 

the factorization implies Nab + Nab + N ab is a sufficient statistic for 6. 

If we define N : to be the number of times in n throws that the needle crosses 
exactly / lines (/ = (), 1,2). with N = N AB , Ni = N ab +N ab , N 2 = N ab and 
X N, = n, then the sufficient statistic can be expressed as N\ + N 2l the number of 
times in n throws that the needle crosses at least one line. 

Now 

(1.40) \\ + N 2 ~B(n.p*) where p* = >n6 = {4r-r 2 )6 

(since p AB = 1 ~ \(4r - r~)6]) so N t +N 2 is a complete as well as a sufficient 
statistic for 6. By the Rao-Blackwell-Lehmann-Schefte theorems 

-. Ni + AA 

(1.41) 0, = — is UMVUE 

mn 

and being the MLE of it has 100% asymptotic efficiency in the double grid 
experiment. 

Its variance is Var (0 3 )= (8/n){l/m -6) which is minimized by l = d (n is 
maximized by I = d). 

In this case m =3, p* = 36. n Var 0, = : [l/(30)- 1] and 

(1.42) A var (#,) = -(— 1^ °' 466 

n ^3 
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Thus, the fully efficient estimator tt 3 is 1.76/.466 = 3.77 times as efficient as 
Schuster's estimator tt 2 , reflecting the fact that 2 is based on 

N A + N B = N AB + N AB + N AB + X AB 
(1.43) 

= Ni + 2N 2 

which is not a function of the sufficient statistic N\ + N 2 - 

We see that antithetic variates are fine for minimizing variance but sufficient 
statistics when they can be found are better. In the development here we learn 
that a throw of the needle {l-d) on the double grid contains not 5.63/1.76 = 
3.20 times the statistical information as in the Schuster situation but actually 
5.63/.466 = 12.08 times the statistical information about the value of ir when 
the sufficient statistic is employed. 

Button's problem with a long needle. To this point our Buffon models have 
included only those situations where the needle was not larger than the shortest 
distance between grid lines. When l>d, there can be multiple intersections. 
Problems connected with the probability of a specific number of crossings or 
moments of the distribution of a number of crossings arise. 

First we will review this situation within the context of the prior section on 
statistical estimation of the value of tt. Mantel (1953) developed a statistical 
estimator in the following way. We now view the expected number of inter- 
sections for a double grid system (equally unit-spaced) which Mantel writes as 

4/ 
(1.44) E = -. 

TT 

No basis for this statement is given by Mantel and we shortly present a more 
general development by Morton (1966) from which the expectation statement 
can be derived. Assuming n throws of the needle we can get say c, intersections 
at the /th fall; / = 1, 2, • ■ • , n and write 

4/ 



(1.45) 






TT = — 
C 


as an estimation of 




(1.46) 






41 


where c 


is the 


average 


number of intersections per fall. By the delta method we 


can get 








(1.47) 






TT 


and 






4 2 


(1.48) 






AVar ir =2 

16/ n 
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where cr 2 is the variance of the number of intersections obtained at the fall of the 
needle. 

Theoretical evaluation of cr c for large / is of interest. Let / » 1 so that certain 
marginal effects can be disregarded. These marginal effects arise from the actual 
location of the end of the line within the squares in which it falls and they would 
slightly increase the value of a c over what is now developed here but would have 
no effect on E(c). 

For any given angle 6 at which the line of length / falls there would be / sin 6 
intersections with vertical lines and / cos 6 intersections with horizontal lines. 
Thus the expected number of intersections is given by 

2 C 12 4/ 

(1.49) E(c) = -\ I(sin6 + ax8)d0 = — . 

TT Jo TT 

The expected square of the number of intersections is 

2 r" /2 

E(c 2 ) = - / 2 (sin 6 +cos df dS, 

TT J 

2 r" /2 

(1.50) £(c 2 ) = - / 2 (l+sin20)J0, 

TT J(, 



£(,")-/■(! +7). 



Thus 



(1.51) 



Here 



a 2 = Ec 2 -[E{c)f = l 2 (\+--— 2 

\ TT TT 



<x # = 7j-v(77-~ + 27r — 16)/(16rc) forlargel. 



(1.52) o-* = >/.0095/n. 



Previously f or / = d and in the single grid, o> = V.5708/n. Thus the precision in 
estimating ir from the double grid with a long needle is about 60 times as good as 
the former precision, or equivalently the information in one fall of the long 
needle here is about the same as in 60 falls of the needle in the original BufTon 
needle problem when the length of the needle is equal to the distance between 
the parallel lines. 
In the Cartesian grid development we obtain 

(1.53) a f=l^- l A, 

I TT TT 

This suggests that an estimate of tt can be made from the variation in number of 
intersections from fall to fall. Let 

(1.54) K = |f 
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where & c is the sample standard deviation of intersections per fall. Then an 
estimate of v can be obtained from the solution to 

(l-V)7r 2 + 27r-16 = 0, 

namely 



„ -1 + VT + 16(1- V) 

" = T=v • 

Let us now examine the sampling variation of -h. For any sample, V must be 
between (all N falls give the same number of intersections) and the value of V 
obtained when half the lines have the minimum number of intersections and the 
other half the maximum number of intersections. The latter will occur when half 
the falls are parallel or perpendicular to the Cartesian grid system and the other 
half fall at an angle tt/4 or 3tt/4 to the fixed grid. In this case, half of the times 
we will get / intersections and the other half of the times we will get 2//V2 
intersections (each 45° line will have 1/-J2 intersections with the horizontal line 
and //vz intersections with the vertical lines). Thus we have 

I ct =-l +-(21 ) = — / , 

i = l Z Z Z 

V = y = p ----[3 + 2V2], 

4 

Thus for V = 0, tt = 3.1231, and for V = (3- 2>/2)/4, w = 3.1752. 

This demonstrates that tt must be between 3.1231 and 3.1752 without any 
experimentation and suggests that an actual simulation will give very satisfactory 
estimates. Mantel proceeds after these developments to give the results of a 
simulation to estimate tv by each of the three methods. Each estimate is based on 
101 falls and 90% confidence interval limits are given: 

Estimate 
Buffon needle case, / = 1 2.75-3.53 

Double grid system 
Mean number of intersections 3.09-3.19 

Variation in number of intersections 3.138-3. 146 

It would be interesting to see a development which combined the ideas of 
Mantel with those of Perlman and Wichura. 

In a very recent paper, Diaconis (1976) has investigated several aspects of the 
single grid Buffon problem with a long needle. He derives the distribution of the 
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number of intersections and approximate moments for large /. The distribution is 
shown to converge weakly to an arc sin law as l/d -» oo. 

When l>d, the probability p(i) of exactly i intersections is given as follows 
where the range of the number of intersections is to [a]+ 1, a = l/d and [x] 
denotes the greatest integer less than or equal to x. Also let the angles 0, 
(0§#, S77/2) be determined by cos 0, = i/a and let <5, =(2a sin 8i/ir)-(2i6i/ir). 
Then for [a] g 2 

(2d 
p(0) = S, + l- 



(1.55) p(/)=5,-_ 1 + 8,- +1 -25i for lgi'SM-2, 
P(M-1)=S M -2-25m-i; p(M) = 8 M -i- 

For [a] = 1, the results for p(0) and p(M) above hold, and 

(1.56) p(l)=(^) + (^j-(4aiind l /7r). 

We can also write the distribution function in simple form as follows 
p (number of crossings ^i) = F(i)= 1 — (<5, ■ — 8, + \) i = 0, 1, • • • , M— 1. 
Diaconis shows also that as a = l/d -> oc, £ & 1 

M 

(1.57) Wk = I/V.- = c k a fc + 0(fl k -- V2 ) 

i=0 

where 



tfc 



{r[ 2 (/c + i)]/r[i(/c + 2)]}vV. 



As a ->oo, the moments of //a converge to Ck where / is the random variable 
describing the number of crossings of the needle. However the arc sin dis- 
tribution has moments c k and since the range of all concerned distributions is the 
unit interval, (I/a) converges in distribution to 



(1.58) 



!(V-W 0^1, 

V"7T/(1 -x 

elsewhere. 



/(*)= W/(l-x 2 ) 1/; 



Kendall and Moran (1963, pp. 73-74) list some values of p(i) and Feller (1971, 
p. 527) refers to the arc sin density for this problem. 

General extension of Button problem. A rather general extension of the 
Buffon problem is provided by Morton (1966). Morton proves an important and 
remarkable result on the expected number of intersections and distribution of 
angles of intersections for a random set of rectifiable curves in the plane. 

Let ^ be a set of rectifiable curves C\, • • • , C m and 3) a set of rectifiable 
curves Di, ■ ■ ■ ,D N of total length /« and Iq respectively. The curves fall 
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"randomly" over the area A and we assume only that (i) the arrangement of the 
two groups of curves on the area A must be independent of each other although 
the individual curves of a group may have a systematic arrangement relative to 
each other, (ii) the arrangement of at least one of the groups on A must be 
random with the random mechanism operating such that the probability of a 
specified point on a curve falling into a subarea of A is proportional to its area 
and the segment may assume any angle relative to some base line with equal 
probability. 
Then the expected number of intersections of the two groups is 



(1.59) 



E = 



2/<g/g 



Furthermore, the density of the intersection angle between the tangents of any 
two of the C Dj is 

(1.60) \smBde, OS0S7T. 

We now sketch Morton's proof. Partition each C a with points C aQ , • • • , C aya 
so that for /= 1, 2, • • • , y a , C a i can be jointed to C a ,_i by a line segment of 
length 81. Similarly partition Dp. (81 is small enough so that end effects can be 
ignored. See Fig. 1.2.) 




Fig. 1.2 
Label the line segments E a \ etc., Fpi etc. Let 

(1.61) dp] be the angle between E ai , F m or their extensions. 

In order that E ai and F m intersect, the midpoint of F m must lie within a 
rhombus with sides parallel to E ai and F m and of length 81 (see Fig. 1.3) and such 
that E ai joins the midpoints of the sides parallel to F ej . This is under the 
assumption that dp] is given. 

The area of the rhombus is its altitude times the length of a side, that is, 

(1.62) [(«)sin0|;](S/). 
Therefore via assumption (ii) 



(1.63) 
Let 



P[E ai , Fpj intersect 1 0p]] ■ 



(8lf sin flff 



Han PiY- 



1, E a ., F p . intersection, 
otherwise. 
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midpoint of F m 



Fig. 1.3 



Then the left hand side is P[I(a„ ft) = 1 1 0%]. Thus via Bayes rule, 



(1.64) 



P[0|;|/(a,-,j8 / )] = l=- 



P[/(a.-,ft)=l|fl|;]/»(flg/) 



lP[/(a„ft)=l|^;]P(^;)^r 
From (ii) we have P(0|;) is uniform on [0, ir]. Therefore, 

(1.65) %/(tti,ft)=l] = ( , 2 4\\ / f/»". =: 7 

J (SI) sin0p;(l/(7rA))d00; 2 



sin £ 



If in the limit as SI -> 0, £„, and F ej do in fact intersect, then 6%-* 6, the angle 
between the tangents of <€ and 3 measured at the point of intersection of the 
segment C a ,,--i, C„,- with the segment Dpj-i, D ej and thus the limting density 
element is 



1 ■ n 

2 sin 0, 



OS0; 



This result was developed somewhat earlier by Wolfowitz (1949) who addressed 
his paper only to intersection angles. 
Furthermore, the probability, 



P[E a: , F $l intersect] = P[I(a„ ft) = 1 1 0?;]P(0g) del) 



(1.66) 



' (Slf sin 6>g; 1 



del 



2(Slf 

ttA 
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Thus finally the expected number of intersections is 

E = — -hm 2, L y<*Pe, 

y a = number of segments that approximate C a , 

p„ = number of segments that approximate Dp, 

2 m n 

E = ~r I Sly a X 5/^ 

7TA a = i (3=1 



(1.67a) 



and 



(1.67b) 



lim Z <5/y<* = Z length of curves in set % = /«, 



lim Z Stye = Z length of curves in set 9) = l®. 



The relationship between the expected number of intersections of a fixed 
group of line segments of total length Li with a group of line segments of total 
length L 2 that fall randomly over an area A that encompases the fixed group of 
line segments has been demonstrated to be 

E = 2L i L 2 

ttA 

This relationship was developed for segments which are not necessarily linear. 
Let us review the conditions and consider only linear segments: 

(i) the arrangement of the two groups of line segments on the area A must be 
independent of each other, but the individual line segments of a group may have 
a systematic arrangement relative to each other; 

(ii) the arrangement of at least one of the two groups of line segments on the 
area A must be at random, and the randomness must be such that the probability 
of a specified point on a line segment falling into a subarea of A is proportional 
to its area and the segment may assume any angle relative to some base line with 
equal probability. 

In the Buffon needle problem, the expected number of intersections is 
equivalent to the probability that the needle of length / ^ 1 (the distance 
between the parallel lines is taken as unity without loss of generality) intersects 
the parallel lines since an intersection, if it occurs, can be with only one parallel 
line. Thus we have from the result on expected number of intersections, 

2/ 
E = P{needle intersects line} = — 

since the total length of the parallel lines, namely L 2 , is A. (Border effects can be 
made trivial by taking areas with dimensions which are longer relative to the 
distance between parallel lines). 
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If we now consider the double grid equally unit-spaced model and / 3= 1, then 

4/ 

(1.68) E = — 

and this is the assertion used by Mantel in his long needle experimental design. 
It is difficult to leave the Buffon model and its generalizations and ramifica- 
tions. For example, it can be shown that if a region bounded by a convex curve 
whose dimensions are such that the region cannot intersect two of the lines of the 
grid simultaneously is thrown "randomly" on the grid, the probability that the 
convex curve will intersect a line is given by 

(1.69) p = ~ 

77-fl 

where L is the length of the convex curve. If we view a straight line as a 
degenerate convex curve, that is, approximated by a rectangle with infinitesimal 
width and length / and thus with perimeter 2/ we obtain Buffon's result p = 
2l/(ird). However, it is interesting that a proof of the more general statement 
can be obtained from Buffon's result as follows. 

Suppose we can prove the result for any convex polygon of diameter size less 
than d. As the number of sides increases we can approximate any convex curve 
and hence the result will follow. 

Therefore, consider an n -sided polygon with sides ai, • • ■ ,a n and diameter 
less than d. See Fig. 1.4. Now if there is an intersection with a line of the grid two 
sides must cross the line (the measure of the set of outcomes with a corner of the 
polygon touching the line of the grid is zero). 



D: 



Fig. 1.4a 
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Fig. 1.4b 



Let 



p(a,) = probability side a, intersects the grid, 

piaidj) = probability sides a, and a, intersect the grid. 

Employing the above observation along with the fact that at most one inter- 
section with the grid is possible, due to the size of the polygon, we have 

(1.70) P(intersection) = £ p(a,a,). 



But 



Therefore 



£>(«.) = I p(a,a,). 



2P(intersection) = 2 £ p(fli#/) 

(1.71) = X X PM 

;=i /=i 

n 

= X P(a<). 
i = i 

Each side of the polygon can be treated as the needle in the BufTon problem. 
Picking a side determines the location of the polygon and with sides picked 
equiprobably we get: 

2/, 
/?(«,) = — - where U = length of a t . 
ird 

Thus 

(1.72) F(intersection) = I- L : = ^ 

ird TTCL 
where L is the perimeter of the polygon. 

Buffon problem in three dimensions and an application. The three dimen- 
sional situation motivating the forthcoming discussion is the estimation of the 
volume-surface ratio of cells or nuclei. This was explored and presented in two 
interesting papers by Chalkley, Cornfield, and Park (1949) and Cornfield and 
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Chalkley (1951). This index might serve to delineate some pathology, for 
example, cancerous versus noncancerous cells. The method proposed in the 
papers takes as its point of departure a remarkable result by Crofton that we 
discuss later which shows that if a line is repeatedly placed at random over a 
plane containing a closed figure, the average length of the chord intersected by 
the figure will be 

area 

(1.73) E(C)=ir : 

perimeter 

no matter what the shape of the figure, as long as its boundary is a convex curve. 
Before this result can be employed for estimating volume-surface area ratios it 
requires modifications: 

a) it must be made applicable to line segments, 

b) a simple procedure for measuring chord length must be found, 

c) it must be extended to cover reentrant figures, 

d) it must be extended to provide an estimate not only of the area-perimeter 
ratio in the focal plane under observation but of the volume-surface ratio 
in the three dimensions of which the observed focal plane is a two- 
dimensional representation. 

The first three of these modifications are accomplished by a single device. 
Consider a line of finite length, say r, and count the number of times each of the 
two end points fall in the interior of a plane figure and denote this by h for hits. 
Denote the number of times the line intersects the perimeter of the figure by c 
for cuts. Then in a very large number of throws we shall find 

(1.74) E{k) 



E(c) perimeter 

for all closed figures including reentrant ones. 

The fourth and important modification is provided by a mathematical result. 
When a line of length r is placed at random in three dimensional space contain- 
ing a closed figure, for a very large number of throws we may write 

E(h) volume 

(175) r-— — = 4 . 

E(c) surface area 

This result will satisfy condition d) since placing a line at random in three 
dimensions can be shown to be formally equivalent to placing a plane at random 
in three dimensions and placing the line at random on the resulting two- 
dimensional plane section. 

If the space contains a series of figures of different volumes and surfaces, then 

, „^ E(h) sum of volumes 

(176) ''f ^um of surface areas ' 

As in two dimensions, this result applies whatever the shape of the figure, and 
covers reentrant as well as convex figures. 
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Basic to the mathematical result is a particular definition of randomness and 
this is where we hark back to the Buffon needle problem development. Other 
definitions are possible and these lead to different results. Therefore a test of the 
method is necessary even though it was verified empirically in the Buffon 
problem. This is discussed in the papers by Chalkley et al. (1949) and Cornfield 
and Chalkley (1951). 

Assume that in throwing a line of length r at random 

a) the probability that one end point, P, will take on any position in the space 
is uniformly distributed over the space, 

b) the probability that the other end point, P', will take any position on the 
surface of the sphere with center at P and radius r is uniformly distributed 
over the surface. 

This is a generalization of the assumptions usually made in geometrical prob- 
ability in two dimensions, for example, in Buffon's needle problem. In the first of 
the two papers we have mentioned, evidence was presented that this appears to 
provide a satisfactory description of the physical process of throwing a line 
segment at random. 

First we show that 

(1.77) E(h) = ^~ 

where X is the volume of the space in which the line segment falls, n is the 
number of throws, and V is the volume in which a hit occurs. 

Let us begin with the two-dimensional problem (see Fig. 1.5). 

The endpoint, P, of the line segment will occupy with uniform probability all 
positions in the square of side X l/2 . Since the angle that the line segment makes 
with the X axis, 6, varies from to v, the point P' will occupy all but some 
corner positions in the square of side X 1/2 + 2r. 

If we consider fixed values of y and 0, the probability that the point P will fall 
inside the figure is 

(1.78) |g 

where w(y) is the width of the figure at ordinate y. Similarly the probability that 
for fixed values of y and the point P' will fall inside the figure is 

(1.79) ^#. 

If we now let y vary from to X 1/2 but hold 6 fixed, we have 



Pr(J»eF) = - 

A Jn 



1 [ XU * 

Pr (PeF) = - w(y)dy, 

A J 

w(y,6)dy. 
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Fig. 1.5 



Each integral obviously is an expression for the area of the figure, A, and is 
consequently independent of 6. In n throws, therefore, the expected number of 
hits by either the point P or the point P' is n ■ (A/X). The expected number of 
hits counting both endpoints is 



(1.81) 



E(h)=2-n-— for 2 dimensions. 

X 



A similar proof in three dimensions, with a similar diagram, will yield with X 
the volume of the cube of side X [n , 



(1.82) 



Pr (PeF) = ~^ for y,z fixed, 
Pr(/»'eF) = Ky 'l/s for y, z, 6, <p fixed, 



(where Pr stands for probability). Therefore 

(1.83) Pr(FeF) = 

Jo 



1 



1 w(y, z) 



V 



x 1/3 'x 1/3 X 1/3 ^^"x 
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- XV! 

■> 


f xl/3 1 1 w(y,z,e, V ) J J 

x in x Ui x U3 ~ d y dz ~- 


V 
X 



Pr(P'eF)- 

and so again in n throws the expected number of hits 
(1.84) 



,,„, V V 2nV 

E{h)=n x +n x=^- 



Let us now calculate E(c). Assume without loss of generality that F is 
composed of m quadrilaterals of area Si, S2, ■ • ■ , S m , i.e., S = surface of F = 
Y.T=i $•• K c < = number of cuts on rth quadrilateral 

m 

E(c)= I £(c). 

1 = 1 

In calculating B(c,-) we note that since a straight line and a quadrilateral can 
either intersect at only one point or not intersect, see Fig. 1.6, E(d) = probability 
of an intersection multiplied by the number of throws. 




Fig. 1.6 



If we consider the rth quadrilateral with surface area S„ the probability that a 
random line of length r will intersect it is the probability that the endpoint P falls 
within the above parallelepiped. Since the volume of the parallelepiped is 
S t r cos 6 this probability is 



(1.85) 



S t r cos 8 



given 
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and thus the unconditional probability is given by 



(1.86) 



S,r cos fl/(6Q 
X 



de. 



We will show that /(#) = sin 6. Assume for the moment that this is so; then 



(1.87) 
Thus 

Thus 

(1.88) 



Pr (P e Parallelepiped) = 



Si 
X 



S t r 
sin 8 cos 6 dO = ~~ ■ 

2X 



cv \ m c a cv \ v m c rnS 

E( Ci ) = —S, and E(c)^— Sl ~. 



E(h) 2n(V/X) AV E(h) V 

E(c)~ mS/(2X)~ rS ' ° T ' E(c)~~ S' 



Now let us consider the probability distribution of 6. We seek the probability 
distribution of the angle 6 formed by a random line with a fixed line under our 
assumption of randomness. To see that the density element is sin Odd let the 
fixed line be the radius of the sphere with center at P and radius r. (See Fig. 1 .7.) 



circumferential 
belt 




Fig. 1.7 
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The probability that the line with endpoints P and P' forms an angle 9 with the 
radius of the sphere is the probability that the endpoint P' falls in the indicated 
circumferential belt or 

surface area of circumferential belt 

Probability = — — — , 

surface area of sphere with radius r 

(1.89) or 

27r(Vr 2 -y 2 )- dss'mOds 
2irr 2 ~ r 

but s = r6 implies 

(1.90) ds = rdd 
and thus the desired density element 

t d8 

(1.91) f(6) d6 = sin d = sin Odd and f(0) = sind. 

r 



CHAPTER 2 

Density and Measure for Random Geometric 
Elements 



In our Buflfon discussion we have referred to the random positioning of a line 
segment in the plane and in a brief way to the similar situation for a line in the 
plane. For the solutions we have developed for probabilities or expectations of 
events, we have employed dp d6 as the appropriate density and \ A dp dd as the 
measure over some region A where p and 6 are the coordinates of the line in 
normal form. The density dpd6 and measure based on it for events regarding 
lines in the plane will lead to probability and expectation statements that are 
invariant under the group of rigid motions in the plane, that is, translation and 
rotation. For some subsequent problems on lines and line segments it will be 
found that invariance under rotation is too demanding and we will relax this 
condition. 

For the present we turn our attention to the more general question of density 
and measure for random geometric elements that leave probability and expec- 
tation statements invariant under translation and rotation. To do this we borrow 
heavily from Santalo (1953), (1976). Remarkable results due to Crofton (1885) 
fall out of the analysis as the principal applications for this chapter. 

When a point in the plane is described by its Cartesian coordinates x, y from 
an arbitrary origin, the appropriate measure for the set of points in a region A is 
given by j A dx dy and this measure is invariant under translation and rotation. 
The appropriate density is dx dy except for a constant factor and this can always 
be treated as unity. All this is accomplished by transforming the point x', y' to 
x, y by the group of rigid motions and noting that the Jacobian of the trans- 
formation is unity. 

If we wish to transform the point (x, y) to (w, v) where x = x(u, v) and y = 
y(u, v) and then get dx dy in terms of du dv we have to be somewhat formal and 
employ the exterior multiplication of differential forms or wedge product 
notions to accomplish this; see for example Flanders (1963). We will employ 
square brackets to indicate this operation, for example, dP = [dxdy] and the 
rules are: 

1) The product is equal to zero if any two factors are equal. 

2) The product is unchanged by an even permutation of the factors and is 
multiplied by (-1) for an odd permutation of factors. 

Now we return to our example where 

x=x(u, v), y = y(u,v). 
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Then 

(2.i) dx J-^^l du+ d -^^l dv , dyJJ^ju+ty^a,, 



du 



dv 



du 



dv 



and 



dx(u, t?)dy(M, u) rj n 3x(m, v)dy(u, v) 
[dxdy} = — ; [dudv]-i — — — [dvdu\ 



(2.2) [dxdy] = \^ 
I du 



du dv dV 

+two terms equal to zero, 
dx(u, v) dy(u, v) dx(u,v)dy(u,v) 



dv 



dv 



du 



du 



[dudv]. 



That is, we get the usual Jacobian. 

Let us look into an application of these ideas. Consider a convex plane curve 
K with a tangent at every point. Let O be a point inside K. (See Fig. 2.1.) 

P(x, y) 




Fig. 2.1 

From each point P exterior to K there can be drawn two tangents to K, say 
PA i and PA 2 . To each of these tangents corresponds an angle (pi and <p 2 formed 
by the perpendiculars OHi and OH 2 with the fixed direction Ox. Conversely, 
the two angles <pi, <p 2 determine the point P{x, y). We wish now to express the 
density dP in terms of the coordinates <p u <p 2 . 

Let ei, tji be the coordinates of the point of tangency A\ and x, y the 
coordinates of P. The equation of the straight line PAi is 



(2.3) 



(x -£i)cos <pi + (y -rji)sin <pi = 0. 



Likewise, the equation of the second tangent PA 2 is 
(2.4) (jc -e 2 )cos <p2 + (y -tj 2 ) sin cp 2 = 0. 
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By differentiation we get 

cos <pi dx +sin<pi dy = l(x -£i)sin <pi~(y -771) cos (pi]d(p u 
(2.5) 

cos <p 2 dx + sin <p 2 dy = [(x - e 2 ) sin <p 2 ~ (y - tj 2 ) cos <p 2 ] ^<P2- 

Since PAi is tangent to A" at A\, dr\\lde\ is equal to the slope of the line PA\, 
namely tan (77/ 2 + <p 1 ) = —cot <p , , or cos <p 1 de 1 + sin <p \ dr\ 1 = 0. Equivalently we 
get cos <p 2 ^£2 + sin <?2 dr) 2 = 0. 

Now recall that (x — e 1 ) cos <p j + (y — 17 1 ) sin <p 1 = and square both sides of the 
equation. We get 

(2.6) (x -£if cos 2 <pi + (y -rfi) sin" <pi + 2(;c -ei)(y — 171) cos <p t sin <pi = 0. 
Note that 

[(a- - ei) sin (pi + (y - t),) cos <pi] 2 = (x -ei) 2 sin 2 <pi + (y - 17 1) 2 cos 2 <pi 

-2(x-ei)(y-T7i)coS(pi sin<pi 

= (x -ei) 2 sin 2 <pi + (y - T71) 2 cos 2 <p, 
- [-(x - e if cos 2 <pi - (y - tji) 2 sin 2 <pi] 

= (x-ei) 2 + (y-T7 1 ) 2 = r 1 2 . 

Thus 

(2.7) (x-ei)sin ^i-(y-?7i)coS(pi = r t 

and similarly 

(jc - e 2 ) sin ^2 _ (y _ T72) cos <p 2 = k- 
By substitution we get 

(2.8) cos (pi dx+sin tp\ dy —t\ d<p\ = 0, cos <p 2 dx+s'm (p 2 dy — 1 2 dcp 2 = 0. 
But by exterior multiplication we obtain 

(cos cp 2 sin (pi —sin <p 2 cos ^>j)[<i;tdy] = t\h[d<pi d<p 2 ], 

sin ((p 2 - (pi)[dx dy] = tit 2 [d(pi d(p 2 ]. 

Furthermore, <p 2 -<p\ — it — id where co is the angle A\PA 2 formed by the tan- 
gents from P. Thus we have 

(2.10) dP = [dxdy] = ^ 1 ^[d<p ] d(p 2 ] or ?^dP = [d<pid<p2\. 

sin co t\t 2 

Integrate both sides of this equality over all possible different values of the 
variables — P can vary over all points exterior to K; <pi, <p 2 can vary from to 2rr. 
However, if in each position we permute <p\ and <p 2 we get the same point P. 
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Consequently, to count each point P only once we must divide by two. There- 
fore, 

(2.H) f ^dP = UlTT)(2Tr) = 2Tr 2 . 

This integral formula is due to Crofton. Note that the right hand member does 
not depend upon the convex curve K. 

If K has a continuous radius of curvature pi we can set p\d<p\=ds\, 
pi d<p 2 = ds 2 where dsi, ds 2 are arc lengths of K at A\, A 2 . Then we may write 

sin w 

(2.12) p\p 2 dP = [dsi ds 2 ] 

hh 

since pip 2 dcpi dcp 2 = dsi ds 2 and [d<pi d<p 2 \ = (sin co/(tit 2 )) dP. 
Now integrate both sides over all possible values of the variables 

(2.13) f ^ Pl p 2 dP = h> 

where L is the length of K. 

It can also be shown (Santalo (1953)) that 

(2.14) f ! ^-^(p 1 + p 2 )rfF = 2 7 7-L. 

Jpextto/f ht 2 

Straight lines in the plane. Represent any line G in the plane by its normal 
coordinates (p, 9), where p is the distance of the normal to the line from an 
origin and ^ S it is the angle formed by the normal to the line and the a: -axis. 
The equation for G can be written as 

(2.15) xcos0 + y sin0-p = O. 

The group of transformations with respect to which we seek an invariant 
measure is the same as for points in the plane, namely the group of rigid motions. 
From Santalo (1953), we get dp dd as the invariant density and thus the measure 
M(A) of a set A of straight lines, G(p, 0)e A, in the plane is defined by 

(2.16) M(A)= f dpdd. 

Up to a constant factor, this measure is the only one which is invariant under the 
group of motions in the plane. The differential form under the integral is called 
the density for straight lines and is represented by 

(2.17) dG = [dpd6]. 

If we take for coordinates of G the coefficients u, v in its equation ux + vy + 
1 = 0, it can be shown that 

[du dv ] 



(2.18) rfG = 



2 , 2\3/2 • 



(H Z +U )' 



DENSITY AND MEASUREMENT FOR RANDOM GEOMETRIC ELEMENTS 29 

This can be seen by noting that 

cos 8 sin 8 

(2.19) «=— — , « = - — • 

P P 

Suppose we represent G by its intercepts a, (3 on the coordinate axes. Then 

(2.20) q 



cos 8 sin 8 

and in similar fashion it can be shown 

(2.21) dG= Ja^Wr- 

In the Bertrand paradox, the probability that a random chord is larger than 
the side of an inscribed equilateral triangle is i when the density is dp dd. This 
suggests that the other solutions of 5 and 4 are generated by random mechanisms 
that do not provide invariant measure under the group of rigid motions. Spe- 
cifically we illustrate this as follows. Let the length of the chord depend on the 
distance from the center of a circle and not on the direction. Suppose that it has a 
fixed direction perpendicular to a given diameter of the circle and that its point 
of intersection with this diameter has a uniform distribution. For the chord to 
have a length greater than V3, the distance of the points of intersection from the 
center of the circle must be less than 5. Thus the probability is 2. Moreover, we 
can write in this case 

f(p 1 8) is uniformly distributed on the interval (0, 1), 

f(6) is uniformly distributed on the interval (0, 2ir). 

Therefore 

(2.22) f(p,6)dpd6=f(p\8)f(0)dpd6 = \/(2Tr)dpdd or, f(p,0)=l/(2ir), 

and so the density is dp dd up to a constant factor. 

Measure and density for straight lines that intersect a curve provide interesting 
and remarkable results. Let C be a fixed curve composed of a finite number of 
connected arcs with a tangent at every point. Let C have length L and define C 
by x =x(s), y = y (s) where the parameter s is arc length. 

In Fig. 2.2, G intersects C at (x, y) and forms an angle co with the tangent at 
this point; t is the angle the tangent makes with the jc-axis. The length s 
corresponding to x and y along with the angle w determine G uniquely for C 
and fixed origin O. Let us find dG in terms of 5 and &>, rather than p and 8 as 
listed in the figure. 

Clearly 

(2.23) + 7r/2 + (Tr~co) + (w-r) = 27!-, 

thus 

8 = (o+t-tt/2. 
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Fig. 2.2 

Since x, y eG 

x cos + y sin 6 -p = 0. 
Therefore, 

(2.24) dp = cos d dx+sin0dy-(x sin 6 -y cos 0)d0; 
but dx/ds = cos r, dy/ds = sin t yield 

(2.25) dp = cos 6 cos r ds + sin sin rds-(x sin 6 -y cos 0) d# 

so dp = cos (0 — r)ds — (x sin - y cos 9) dd. 

Since t is clearly a function of 5 alone dd = dw + r'{s)ds is implied by 
8 = o)+t-tt/2. 

Then exterior multiplication yields 

dG = [dp d6] = cos (o) — ir/2)[ds(dw + t' ds)] 
= sin a>[ds dco]. 
Through integration we have 



(2.26) 



(2.27) 



n(p,6)dpd0= sin u> dw ds 



where n(p, 6) is the number of intersections of the line (p, 6) with C. That is we 
must take into account the fact that every time a line G intersects C it counts 
into the left hand side. If G does not intersect C, n(p,d) = 0. Thus 
\ n(p, 6) dG — 1 |^ sin lodcods — 2L. Furthermore if C is convex then 



n(p,0) = 



if G does not intersect C, 
2 if G does intersect C 



and thus 

(2.28a) 



n(p,0)dG = 2L becomes 2dG = 2L 

GDC Jghc 
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or 



(2.28b) 



dG=L. 



Jonc 



In other words the measure of the set of straight lines that intersect a convex 
curve is its length. 

Suppose a curve C of length L is contained in a convex curve C\ of length L\. 
Consider all straight lines that intersect C\. The expected number of inter- 
sections of such lines with C is given by n, 



(2.29) 



\n(p,d)dG 2L 



\dG L,- 

We may remark that dGj\ dG is the probability that a line (p, 6) belongs to a 
small piece dG and n(p, 6) is its number of intersections with C. This shows that 
if a curve of length L can be contained inside a curve of length L\, there exist 
straight lines which cut it in 2Z./L] points at least. 




Ci with length L i 



with length L 



Fig. 2.3 



Suppose again C\ is a convex curve of length L t and let C be a chord of Ci of 
length L. Construct a rectangle C to approximate C with approximate peri- 
meter 2L. (See Fig. 2.3.) Then the expected number of intersections with C and 
hence with C, of lines that intersect C\ is given by 



(2.30) 



\c 



dG 2L 



\ G ncdG Li 



since C", C\ are convex. 



Therefore we can calculate the probability of two lines intersecting within C\ 
and this is given by 



(2.31) 



2L(p,0) dpde 



L x \dpdd' 
that is 2L(p, 6)1 L\ is the probability of a line intersecting a fixed line (p, 6) with 



2 it A 

2 
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length L(p, 0) and dp dd/(\ dp dO) is the probability of line (p, 8) e a small piece 
dG. Thus the probability that the two lines intersect is 

(2.32) -j \L(p,0)dpdd = - 1 \ dO L(p)dp = ~ 

where A is the area of C x , \ L(p) dp = A for fixed 6. Finally the expected length 
of a chord intersecting C\ is given by 

(233) $ GnCl L(p,8)dG _nA 

lone, dG L i 

This is one of the ingenious results due to Crofton that we have mentioned 
previously in discussing the estimation of the ratio of the volume to the surface 
area of a convex body. 

We now comment briefly on measure and density for straight lines in three 
dimensions. Represent a line in 3 dimensions by four parameters a, b, p, q, that 
is, 

(2.34) x-az+p, y = bz+q 
and the intersection of two planes determines a line or 

,I X ~P\ bx bp 

(2.35) y=b{ )+q= - + q. 

\ a I a a 

This representation omits the set of all lines parallel to the plane z = but these 
lines will have measure zero and hence offer no difficulty. Now set 



(2.36) m(A) = 



F(a, b,p,q) da db dp dq 



and we want m(A) as usual invariant under translations and rotations. In this 
case we find in Kendall and Moran (1963, pp. 18-19) that 

(2.37) F(a,b,p,q)=(l+a 2 + b 2 y Z 

and hence the density for straight lines in three dimensions is 

(2.38) f ad \ dP % 

and is unique up to a constant multiple. 

Pairs of points in the plane. Now we discuss density and measure for pairs of 
points in the plane. Given a pair of points Pi, Pi we can determine them 
uniquely by four coordinates (X ly Y lt X2, Y 2 )- Then it can be shown that the 
invariant measure over some region A is 



(2.39) m(A) = 



dPi dP 2 = dx\ dx 2 dyi dy 2 . 

J A 
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P2(x 2 ,yi) 



r ; cos(0--j psin(e--j 



Fig. 2.4 



Consider Fig. 2.4 where p, 8 are the parameters for the line through the points 

Pu Pi- 

We wish to write dPi dP 2 in terms of p, 6, t\, t 2 . Care should be taken in the 
interpretation of the directions of t\, t 2 . Now, 

(2.40) [dPi dP 2 ] = [dxt dx 2 dy x dy 2 ] =f(p, 6, t u h\dpd0dh dt 2 ] 
and 

(2.41) Xi = p cos 6 - f, sin 6, y t = p sin 6 + f, cos 6 for i = 1, 2. 
Figure 2.4 verifies that 



(2.42) 

Thus 

(2.43) 



x 2 - 



t 2 \ cos ( 6-—) +p sin l^ - ^") = ~f2sin 6 + p cos 0. 



dxt = cos 8dp-(p sin 6 + r, cos 6) dd - sin 6 dt t , 
dy t = sin 6 dp + (p cos 6 - f, sin S)d8+ cos 6 dt, 
and 

(2.44) [dxt dy ( ] = p[dp dd] + [dp dt t ] - t[dd dt t ] 
and 

(2.45) [dPr dP 2 ] = [dxi dy x dx 2 dy 2 ] = \t 2 - h\[dp dd dt, dt 2 ] 
via exterior multiplication. Thus 

(2.46) [dP,. dP 2 ] = \t 2 -t x \[dGdt x dt 2 ] 

and the measure for pairs of points is a function of the distance between them in 
terms of the new coordinates. 
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Now let A" be a convex curve of length L and regional area A. Denote by 
a{p, 6) the length of the chords determined by the straight line of (p, 6) on K. 
(See Figs. 2.5 and 2.6.) Define I„ =\ a" dG where n is a positive integer and the 
integral is taken over all straight lines G which cut K. From before we have 



(2.47) 



h=\ dG = L, /, = 



dG 



adpdd = it A. 



If r(Pi, P 2 ) is the distance between Pi, P 2 for any pair of points Pi, P 2 eK then 
define 



(2.48) 



/„ = r" dPi dP 2 . 



Thus we write 
(2.49) 



h= \h-tx\ dGdi x dt 2 . 



Fixing p, we see that 8 determines a chord of length say a = b-a as in Fig. 
2.6. Fix t\ and integrate t 2 over [a, b] and we have 

r *b 



/„ = 



dGdfi f (f2-M" +1 *2 
-■''1 



+ 



(h-t 2 T +l dt 2 



dG 



(2.50) 



n+2 

= (n + 2)(n + 3)\ (b - ar3 

2 



({b-h) n+2 + {h-a) n + 2 )dh 



dG 



(n+2)(n+3) 



a n+3 dG 



(«+2)(« + 3) 
This produces the relations 



/„ +3 . 



/„=- 



(2.51) 



(« + 2)(«+3) 
„ w(«-l) r 

in — I -/fi-37 



/„+3, nS-1, 



ng2. 



Then we have the following 



1-,= 



a~ dG = 



f rfPi ^P 2 



= /-!, 



(2.52) 



h = 



a dG = 3/o = 3 



dPi 



dP 2 = 3A\ 



This last result valid for any convex curve is another remarkable result due to 
Crofton. 
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Fig. 2.5 




Fig. 2.6 



Let r be the mean distance between a pair of points belonging to K. Then 

\rdP x dP 2 _Jx U 



(2.53a) 



\dP x dP 2 A 2 6A 2 



or 



h = 6rA 2 . 



(2.53b) 

There are relationships between the /„ and /„. For example 



(2.54a) 

or 
(2.54b) 



/o- 47,^0 



\AttA, 



the classical "isoperimetric inequality". The equality holds only when K is a 
circle. Other inequalities due to Blaschke and Carleman respectively are 



(2.55) 



(15) 2 7r 8 / 4 2 -2 16 / 1 5 g0, 



(\6) 2 I?-3 2 tt*i1^0. 



In both cases equality holds only for the circle. 
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Suppose we consider circles only and let K be a circle of radius R. Then 
Io = L = 2itR, 



(2.56) 



/, 


— TtA = 77 


2 R 2 


, 




h 




16 

3 


n'R 2 

2 ' 
77 


_ 1677 3 

" 3 *' 


n 


2 16 i? 

~ K 2 8~ 
15 77 


2 8 
"15 


ir 5 R 5 

4 " 
77 


25677i? 5 
15 



Hence for any pair of points belonging to the circle, 

\(l/r)dP 1 dP 2 h 1677i? 3 16 



(2.57) e(^j = 

(2.58) E(r) = 



\dP x dP 2 A 1 3,iT 2 r 4 3t7R' 

\rdPidP 2 I A 256ttR 5 128R 
\dP x dP 2 ~ 6A 2 ~ 6- 15tt 2 R 4 ~ 45t7 ' 



The last two expectations arise in statistics from time to time and have been 
developed in different ways. 

We now offer a brief presentation of measure and density for planes in three 
dimensions (Santalo (1953)). A plane is specified by three parameters, p, 0, <p 
(the usual spherical coordinates), where p is the distance from the origin to the 
plane, 6 is the angle the projection of p on the xy -plane makes with the x -axis, <p 
is the angle between p and the z-axis. The ranges are 

(2.59) 0^p<oo, 0^0^277, 0S(pS77. 
The equation of a plane with this representation is given by 

(2.60) x sin 9 cos <p + y sin 6 sin <p + z cos = p. 

If we again seek an invariant measure under translations and rotations we will 
obtain the density 

(2.61) / (p, 6, tp) dd dp dtp = sin 6 dO dp d<p 
and if A is a subset of the parameter space, 



(2.62) m(A) = 



sin 6 dB dp dtp 



is an invariant measure for planes in three dimensions. If the equation of the 

plane is written as 

ux + vy + wz — 1, 
(2.63) 

-00<M<00, -00<t)<00, -00<W<00 

we can express the density in terms of u, v, w since 

.- ^ „ . sin cos tp sin 9 sin <p cos 9 
(2.64) u = -,- v- , w = . 
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By exterior multiplication we obtain 

(2.65) sin d[d& dp d<p] = — 2 5 ^rldudvdw]. 

(u +v +w ) 

This representation will be helpful when we look at the Sylvester problem in 
three dimensions. 
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CHAPTER 3 

Random Lines in the Plane and Applications 



In a previous chapter, we provided a proof by Morton (1966) that for a set of 
rectifiable curves contained in a plane and separate set of rectifiable curves 
constrained to be at random on a particular region of the plane, the expected 
number of intersections between the two sets of curves is given by 

(3.1) £ = ^T 

where L c is the total length of the rectifiable curves Q, i = 1, 2, • • • , «,, con- 
tained in the plane, L D is the total length of the rectifiable curves D h j = 
1, 2, • • • , n„ placed at random in the plane, A is the area of the region of the 
plane which is as defined previously; that is, it is the set of points of distance h 
from any arbitrary point of A /•/ S h § r G , where r c is the radius of the circle 
centered so that it circumscribes C and r t is the shortest distance from the center 
to C. This condition dictates intersections. Also the angles, 6, between the two 
sets at their points of intersection are distributed as 5 sin 6. 

This is a fascinating statement and now we examine the number of inter- 
sections of random lines in the plane and their angles of intersection. There are 
some earlier results for what we term the isotropic case, to be defined soon, 
by Miles (1964) who derived the expected number of intersections and by 
Wolfwitz (1949) who we previously mentioned derived the distribution of the 
intersecton angles. 

Questions of the variance, higher moments, and the distribution of the num- 
ber of intersections are of much interest. Directly related to this are the dis- 
tributions of the magnitude of areas of polygons and number of sides of polygons 
formed by random lines in the plane. We now turn to this topic. 

An initial contributor to this subject is Goudsmit (1945) who was motivated 
by some problems in physics — originally the positioning of tracks in cloud- 
chamber experiments and subsequently some other topics. To avoid difficulties 
with the notion of infinite lines in the plane, Goudsmit considered the analogous 
problem on the sphere. In that case the straight lines are replaced by great circles 
on the sphere. These great circles will intersect an arbitrarily chosen equator and 
because of symmetry we need only consider a hemisphere. Then we can study 
the distribution of the areas of the regions on the surface of a hemisphere that 
are formed by a large number of "great circles", i.e., circles formed by the 
intersections with planes through the center of the sphere. Each of these circles 
can be defined by the coordinates of one of its poles on the surface of the sphere 
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and we can then assume that each of these poles is uniformly distributed over the 
surface of the sphere. This provides the random mechanism by which N great 
circles emerge. By letting N go to infinity we obtain the analogue to an infinite 
number of line segments in the plane. Then by letting the radius of the sphere 
increase without bound we obtain the lines in the plane. 

First Goudsmit presented a simplified version of the problem which can be 
solved completely without resorting to the Sphere. Assume a plane is then cut 
into rectangular fragments. The lines in each set are distributed at random in the 
following way. Let the density of the lines be such that the x-axis is intersected 
by one line per unit length on the average and similarly for the y-axis. That is, 
the number of points of intersection is given by a Poisson random variable with 
parameter A = 1 . The problem is now reduced to that of two independent 
distributions for a number of random points on a line, one for the x-axis and one 
for the y-axis. Consider one rectangular fragment. The probability that its 
horizontal dimension is between t\ and ri+drj and its vertical dimension 
between £, and £ + d$ is given by 

(3.2) f&r 1 )d{dr 1 =ex V >[-({ + v)]dZd V 

since each length is given by the exponential distribution. 
We are, however, only interested in the area a- = £17. Let 

(3.3) /(£ n) d£d<n = exp [-/*] dp dvK^-Av) 112 

d/u, dcr 
because dij di\ = —^ — 1/2 . 
(ix -4<t) 

Note that ii 2 ~^Aa. Thus the probability that the area of a rectangular frag- 
ment is a is given by 

e'* du 



(3.4) F(o-) dcr = do- J 



j^{(jl 2 -Act) U2 - 
The expectations of powers of <r are easily obtained. 



f 
(3.5) E(a k ) = 

Jo 



Thus E(a)= 1. This value arises because the assumption of one line per unit 
length on the average has standardized these expectations in this way. 

Now we return to the general problem and consider the distribution on a 
sphere. Assume that there are N halves of great circles distributed at random on 
a half sphere, not counting the "equator". (See Fig. 3.1.) 

We can derive the following properties. The N lines divide the half sphere into 
[2N(iV + l)+l] fragments; and on the average each fragment has four sides 
when N increases indefinitely. The first statement can be proved by induction — 
the Mh line intersects all (N-l) previous lines and in doing so cuts each of N 
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Fig. 3.1 



fragments into two, thus adding just N fragments to the total. We started with 
one fragment, the whole half sphere; thus the number of fragments is given by 



(3.6) 



1 



N 
i = l 



j/V^+lRl. 



Now for the second statement. As we have already said, each of the N great 
circle halves is cut by the others into N segments. Each one of these segments 
serves as side to two adjacent fragments. In addition the "equator" is cut up into 
2N segments bordering one fragment each. The total number of sides to the 
fragments is therefore 



(3.7) 



2N Z + 2N. 



The average number of sides per fragment is 



(3.8) 



2N Z + 2N 
l(N 2 + N)+l 



and this approaches 4 as N -* <x>. 

The total surface area of the half sphere is 2ttR~ or 2tt since R = 1, and since 
there are j(N +N)+ 1 fragments, the average area of a fragment is 



(3.9) 

or asymptotically it is 

(3.10) 



l(iV 2 + N")+l 



4tt 



The total perimeter of the i(N 2 + N)+ 1 fragments is twice the total length of 
all the great circles in the half-sphere, i.e., 2ttN, plus the circumference of the 
equator; thus the average perimeter of the fragment is 



(3.11) 



2ttN + 2tt 
\{N 2 + N)+\ 
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or asymptotically it is 

and the average length of each side is asymptotically 
(3.13) ^(i)=^. 

We can now suppose that N becomes very large and consider the distribution 
of regions in a small circle of radius r on the surface of the sphere. The number 
of great circles which intersect this small circle will be asymptotically equal to rN 
(recall 0<r< 1) i.e., essentially the proportion of the small radius r to the unit 
radius multiplied by N the number of great circles. Proceeding to the limit and 
considering the region inside this small circle as approximately a plane circle, we 
find that the average area of the regions in a plane formed by random lines 
requires rescaling. This is developed now for Goudsmit's development. In the 
following sections on Poisson fields we will return to this point again. 

Let us examine this in more detail going over to the plane case from the circle 
on the sphere and letting N and the radius of the sphere become infinite. We 
must watch the rescaling or standardization quite carefully. Suppose we wish to 
adjust the rescaling so that the average area of the fragments is unity; then the 
area of the half sphere has to be N jl for large N (recall that there are 
[i(N 2 + N)+ 1] fragments, therefore 

N 2 /2 

-» 1. 



5(JV 2 + iV)+l 



The radius of the half sphere is then \NI*Jtt, (i.e., 2n(aNj^7rf = N 2 /2) and the 
length of the great circle half is (2"N/vff)7r = 2Nv7J\_The mean length of the 
segments into which the great circles are cut is thus Jn/2 and not unity as it was 
in the simplified rectangular case. 

New let us look at the mean square area of the fragments in the general case. 
Consider two arbitrarily chosen points and ask for the probability that they 
happen to be in the same fragment. This probability can be expressed in terms of 
the mean square area of the fragments. Next consider the line which is deter- 
mined by the two points and ask for the probability that the two points fall both 
in one of the segments into which the line is divided by all the other lines. This 
latter probability can be computed and thus the mean square area obtained. 

The probability that the first arbitrarily chosen point lies in a fragment of size 
between a and a + da is given by the fraction of the total area which is covered 
by such fragments, namely 

ctSG((t) da 



(3.14) 



\aSG{a)da 



where G{a) da is the density for area, S is the number of fragments in the total 
area. 
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The probability P 2 that the two points lie in the same fragment irrespective of 
its size is equal to the product of these two expressions integrated over all sizes 



(3.15) 



P 2 = 



j a SG(<t) da Ea 



[\ aSG(a) da] 2 S[Ea] 2 



We next consider the probability that the second point lies at a distance / to 
/ + dl from the first. This is given by the area of a ring divided by the total area 
(see Fig. 3.2) 



(3.16) 



2ttI dl/ total area. 




Fig. 3.2 



Through the two points we pass a line which will be cut into segments by all 
the lines already present. The chance that no intersection will occur between the 
two chosen points is exp [-//(2V77-)]. The factor vV/2 arises from the rescaling 
discussed above; the mean distance between intersections is 2 ^tt. 

The probability that regardless of their distance the two points are not 
separated by one of the lines is thus given by 

(3.17) P 2 =\277ldle' ,HO/2W ' ) /(SE(a)), P 2 = ^tt 2 /(S ■ E(aj). 

Recall that 5 • E(a) = total area, but by the rescaling we have made E(a) = 1, 
and so 



(3.18) 



P 2 = \tt 2 /S. 



Now compare with P 2 = E(a) 2 /(S ■ [E(a)] 2 ) and we get 

E(af 



(3.19) 



= 1 2 

[E{a)f I 77 ■ 



Note that this differs from the rectangular fragments case which gave the value 4 
for this ratio. 

To this point, Goudsmit is attempting to provide a random mechanism to 
produce lines in the plane and to develop some of the manifestations of this 
randomness such as the ratio of the second moment to the square of the first 
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moment. This knowledge can provide some check on the tenability of a ran- 
domness conjecture for the positioning of tracks in cloud-chamber experiments. 
Obviously knowledge of higher moments would provide more information and 
shortly we will present a derivation of the third moment employing Goudsmit's 
method of obtaining the second moment. The fourth moment and higher 
moments are not yet known. 

First, however, we characterize a class of random lines in the plane that is 
unaffected by choice of origin or coordinate axes. It will turn out that Goudsmit's 
randomness mechanism is essentially leading to this class of random lines which 
is termed the Poisson field of random lines in the plane. Another name we shall 
employ is isotropic lines to distinguish it from some extensions when sub- 
sequently we view highway traffic flow models from some anisotropic line 
models. 

Poisson field for random lines in the plane. Any line in the (x, y) plane can be 
represented as 

(3.20) p=xcos0 + y sin (9 (-oo<p<oo, 0^d<n) 

where p is the signed length of the perpendicular to the line from an arbitrary 
origin, O, and is the angle this perpendicular makes with the x-axis. Note that 
if the intersection of the perpendicular with the line is in the third or fourth 
quadrant, p is taken to be negative. A set of lines {(p*, &):/' = 0, ±1, ±2, • • •} are 
a Poisson field under the following conditions. 

1. The distances • • • Sp- 2 = P-i=Po = Pi = P2 = - ' • of the lines from an arbi- 
trary origin, O, constitute the coordinates of the events of a Poisson process with 
constant intensity, say t. Thus, the number of p, in an interval of length L has a 
Poisson distribution with mean tL. 

2. The orientations 0, of each line with a fixed but arbitrary axis in the plane 
are independent and have a uniform distribution in the interval (0, it). 

This is a reasonable representation of random lines in the plane and we label it 
the Poisson field. Note that for a fixed line segment, the intersections are 
uniformly distributed over a range equal to the length of the line segment and 
thus have density proportional to dp leading to a joint density proportional to 
dp dO. This definition of randomness for lines in the plane is natural because the 
randomness is unaffected by the choice of origin or x-axis since dpdO is the 
invariant density under rotations and translations. 

One basic feature of randomness of lines generated by a Poisson field is the 
following result. The number of random lines intersecting any arbitrary line 
segment of length x is Poisson distributed with mean 2tx/tt. To prove this result 
we note that the randomness of the lines in the plane is not affected by rigid 
motion transformations and thus we can choose the origin and the axis orien- 
tation so that the line segment corresponds to the horizontal axis from -x/2 to 
x/2. Let N x denote the number of lines intersecting the x-axis from -x/2 to x/2. 
Then we wish to show 

(3.21) P[N x = n] = e~ 2n/1r (2Tx/ir)"/n\. 
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Let N p denote the number of random lines whose signed distance, p, to the 
origin is between — x/2 and x/2. Then 

(3.22) P[N x =n]= f P[N x = n\N p = m]P[N p = m]. 

Clearly no line can intersect the x-axis between —x/2 and x/2 unless its mini- 
mum distance to the origin is between -x/2 and x/2. Thus N p must exceed N x ; 
that is, 

(3.23) P[N X = n\N p = m] = iom>m 
and therefore 

(3.24) p[N x = n]= I P[N x = n\N p = m]P[N p = m]. 

Let n =P[N X = 1 \N P = 1]. Then since the random lines are independent 

(3.25) P[N x = n\N p = m] = ( m ^ n (l-n) m '" form^n. 

By the Poisson field notion with intensity r for random lines in the plane, we may 
write 



(3.26) 




P[N p = m] = e' rx (Tx) m /ml 




Thus 










P[N X -- 


°° YYl ' P 

~n\ L ., ../u. (1 jO 
m =n «!(m — «)! 


(tjt) 


(3.27) 




(fj-rx)" „ MTX 


where 



since 

n 0fi ^ v ( 1 ~m) <! (ta:)'' ~ [(I-m)tjc] 11 tx _ mtx 

(3.28) 1 — = 1 — = <? 

<c=o /c! /t=o /c! 

Now we evaluate /la, the probability that a line whose minimum (signed) 
distance to the origin is between —x/2 and x/2 will intersect the x-axis between 
-x/2 and x/2. Let r? denote the ^-intercept of the line in Fig. 3.3. Then 
17 = p sec 6 and 

/i = P[|rj|2iJc/2||p|sx/2] 

(3.29) =P[-Jc/2Spsec0gjc/2||p|gx/2] 



= 2 



P[-x/(2p) g sec § x/(2p)|p] rfp/x 
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where the last expression takes into account the symmetry about the origin of 
the density dp/x of p given th at |p|g x/2. The density of is dO/w. Let y = sec 6. 
Then dy = tan 0sec 6 dO = yVy -\d0 and the density of y is dy/(yVy 2 - 1). 
Thus 



F[-jt/(2p)^sec(?s JC /(2p)|p] = if 

7T J 



*/(2p) 



dy 



(3.30) 
Hence 

(3.31) 

and we have 
(3.32) 



cos 



■x/(2 P) yv 
i /2p' 



y 2 -i 



77 



^ / 



n 



x/2 



cos 



7T 
= 2/7T 

P[N x = n] = e 



■©©* 



" 2 TJt/ 7T 



(2tx/tt)7«!. 




Fig. 3.3 



The number of random lines intersecting disjoint sections of a line are clearly 
independent. Thus the points of intersection of the random lines with any 
arbitrary line constitute a Poisson process of intensity 2t/tt. This result without 
the development appears in Miles (1964). 

Previously we derived some of the statistical properties of the polygons into 
which the random lines divide the plane. When the number of random lines is 
finite or the area of the plane under consideration is finite, difficulties arise with 
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edge effects and infinities. These problems can be avoided by considering a half 
sphere on whose surface are random great circle arcs whose poles are uniformly 
distributed over the surface of the half sphere. As the radius of the half sphere is 
increased toward infinity the random great circles become random straight lines 
in a Poisson field. By considering random great circles and the fragments into 
which they divide the surface of the half sphere and letting the radius of the 
sphere increase to infinity, we have obtained properties of random lines in the 
plane, namely the mean area, mean perimeter, and mean number of sides of the 
polygons into which the plane is divided. 

As we let n increase toward infinity we must also increase the radius of the 
sphere in such a way that the limiting density of lines in the plane is r. Since the 
points of intersection on any line in the Poisson field of lines in the plane 
constitute a Poisson process of density 2r/7r, the mean separation between 
intersections is tt/(2t). If R is the radius of the sphere, the mean separation 
between intersections on a great circle is TrR/n. Thus we should choose R to be 
\njr so that the limiting density (as n -* oo and R -> oo) of the lines in the plane is 
r. Then the results we obtained before (from Goudsmit) can now be written in 
the following way: The surface area of the half sphere then becomes 2-nR 2 = 
2im 2 jj 2 . Thus as n ->oc the fragments on the surface of the half sphere become 
polygons in the plane and the average area of the polygons is tt/t~. 

The average perimeter of a fragment can be obtained in a similar way and it is 

(3.33) 7 t in 2 +n ^ 

j(n +n)+\ 

which as n -»oo becomes 2 77/7. Consequently the average length of a side of a 
polygon, i.e., the average length of the segments into which the random lines are 
cut, is 2ir/r. 

Henceforth we shall adopt the letters N, L, and A to denote the number of 
sides, the perimeter, and the area, respectively, of a polygon. Using the fact that 
as n -»oo the average values of N, L and A approach the mean values, we have 
obtained: 

(3.34) E[N] = 4, E[L] = 2it/t, E[A] = tt/t 2 . 

If one wanted to test whether a given set of lines in the plane were, in fact, a 
Poisson field, it would be desirable to have second moments as well as these 
mean values. The value of £[A 2 ] was first derived by Goudsmit. His approach 
consists of using two different methods to evaluate the probability that two 
random points r { and r 2 lie in the same polygon. One method results in an 
expression involving £[A 2 ] and the other method results in an expression which 
can be explicitly evaluated. The first method is as follows: 

Let h(a) be the probability density of the random variable A i.e., \°* h(a) da, 
P[cri < A < <t 2 ], the fraction of all the polygons which have areas between er { and 
c 2 . For small da the fraction of the number of polygons which have areas 
between a and a + da is h(a)da. Let M be the total number of polygons in 
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some large area D. Then for large M, Mh{a) da is approximately the number of 
polygons with areas between a and a + da, and aMh (a) da is approximately the 
total area of such polygons. Then the probability that ri lies in a polygon with 
area between a and a + da is (asymptotically as M-*oo and D-*co) 
aMh(a)da/D. 

The probability of the second point, r 2 , lying in the same polygon is a/D. Thus 
the probability of both points lying in the same polygon irrespective of its size is 
the product of the above expressions integrated over all sizes: 

r°° 

(3.35) M\ a 2 h(a)da/D 2 = ME(A 2 )/D 2 . 

The second method of evaluating the same probability is as follows. The 
probability that the second point lies at a distance x to x+dx from the first is the 
area of the ring of width dx divided by the total area, or 

(3.36) iTTxdx/D. 

The probability that no line will divide the two points is the probability of no 
intersection on the line segment joining the two points, namely e~ 2rx/n . The 
probability that irrespective of their distance apart the two points are not 
separated by one of the random lines is 

(3.37) f 2TTxe~ 2rx/rr dx/D = l-n 3 /(T 2 D). 
Jo 2 

Hence, equating the two expressions for the probability that two random points 
lie in the same polygon, we get 

E(A 2 ) = ±7t 3 D/(t 2 M). 
As D and the number of lines approach infinity 

(3.38) D/M^E(A) = tt/t 2 and so E{A 2 ) = )-n //. 

Higher order moments. A variation of this basic method can be used to obtain 
moments involving A 3 . Consider three points, r x , r 2 , and r 3 , which are randomly 
placed in some large domain D. Let the random variable P be the perimeter of 
the triangle formed by the three points; i.e. 

(3.39) P = \\r 1 ~r 2 \\ + \\r 2 -r 3 \\ + \\r 3 -r i \\. 

We shall average an arbitrary function g(p) over all triples r\, r 2 , and r 3 that lie 
within a common polygon. First we derive the probability density function f(p) 
of p. 

f(p)bp = ?r[p<PiSp + A p ] 

(3.40) rP/2 

FT[p<P^p + \p\\\r 1 -r 2 \\ = x][2vxdx/D). 
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(Here we use Pr for probability to avoid confusion with perimeter P.) Given that 
\\ri - ri\\ is x, the locus of points r 3 such that P is exactly p is an ellipse with major 
and minor axes 



(3.41) 



1 



1 



a(p,x) = -(p-x), b(p,x) = -(p -2xp) 



1/2 



Thus, given that ||ri - r 2 \\ is x, the probability that r% (randomly placed on the area 
D) will result in a value of P between p and p + Ap is the difference (divided 
by D) between the area of the ellipses with axes a(p + Ap, x) and b(p + Ap, x) 
and with axes a(p, x) and b(p, x); i.e., 



(3.42) 



[7ra(p + Ap, x)b(p+Ap, x)-tra{p, x)b(p, x)]/D. 



Substituting this expression for Pr [p<CPSp + Ap|||ri-r 2 || = x] in our equation 
for f(p) Ap we can write 



f(p) 
= 2( 

(3.43) 



2(n 2 /D 2 ) 



.(p+Ap)/2 



,p/2 

(p + Ap, x)b(p + Ap, x)xdx - a(p, x)b(p, x)xdx/Ap\ 



-2(tt 2 /D 2 ) 



(p+\p)/2 



a(p + Ap, x)b(p + Ap, x)x dx/Ap. 



p/2 



As Ap approaches zero, the last integrand on the right hand side approaches 
zero. Thus we get 

d T f p/2 1 

f(p) = 2(7T 2 /D 2 )— [j o a(p,x)b(p,x)xdx\ 



(3.44) 



= 2(tt 2 /D 2 ) 



J r 4 "* 



dpL60 210 

= (2/21)77 2 p 3 /£> 2 . 



] 



The probability that the three points lie in the same polygon is the probability 
that no random line intersects the triangle with vertices r u r 2 , and r^. We have 
shown that this probability is e~ TP/ ". Thus the mean of g(p) over all r x , r 2 , and r 3 
in a common polygon is 



(3.45) 



1 "> 2 r °° 

pit! sM' 3 <-""*- 



Proceeding in the same way as with two points we get 



M 
(3.46) — 3 E 



j g(p) d<r x do- 2 do- 3 \ = jp -jr 



g(p)p e 



3 .-rpl* dp 
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so that 



(3.47) 



g(p) da x da 2 da^ 



D 2tt 
M 21 

■E(A) ? 



3 —Tp/w 



g(p)p> e-™" dp 



21 



g(p)p e TP "dp 



2\t z 



g{p)p i e-^" dp. 



Choosing g(p) = 1, we get i?[j4 3 ] = 7vr 7 /r 6 . Letting g(p) = p we get 



(3.48) 



I (\\ri~ r 2 \\ + \\r 2 - r 3 \\ + \\ri- ri\\) da-i da 2 da 3 



2 TT 



Since \\\ ||r, - r,-|| da x da 2 da 3 = RA 3 for i*j, we get E[RA 3 ] = 2 4 77 8 /(21t 7 ). 

A third variation of this method yields some of the moments involving L. We 
randomly place two points r x and r 2 in a large domain of area D and average an 
arbitrary function g(\\ri - r 2 \\) over all pairs (r iy r 2 ) such that the points fall inside a 
common polygon and each point lies within a small distance, w, of its boundary. 

We shall consider four disjoint sets of random lines: 

5: separating lines, that pass between r\ and r 2 ; 

C: common lines, within a distance w of r x and r 2 but not between them; 

Ei and E 2 : end lines, within w of one point but not within w of the other point 
and not between them. 

The lines in S are exactly the lines intersecting the line segment of length x 
between r\ and r 2 . We have already shown that the number of lines in S is 
Poisson distributed with mean Ms = 2tx/tt. 

The lines in the union of the four sets are exactly the lines that pass within a 
distance w of the line segment between ri and r 2 . Thus they are exactly the lines 
intersecting the convex region shown in Fig. 3.4. Since the mean number of 
random lines intersecting a convex figure of perimeter p is 177/77 and the region 
in the diagram has perimeter 2x +2ttw, we have 



(3.49) 



Ms+M c + 2M B = 2t(x + ttw)/tt 



{M e denotes Me, which equals M E2 by symmetry). 

Similarly the lines in the union of 5, C and E, are exactly the lines that 
intersect the convex region shown in Fig. 3.5 which has perimeter [rrw + 
2w sin" 1 (w/x) + 2(x 2 -w 2 ) l/2 ]. Thus we have 

(3.50) Ms+M c +M E = T[Trw+2wsm~ 1 (w/x) + 2(x 2 -w 2 ) l/2 ]/Tr. 

Expanding the sin -1 term in a series and solving the last three equations, we get 

M e = tw-tw> /(irx) + o(w 3 ), 
(3.51) 

M c = 2tw 2 /(ttx) + o(w 3 ). 
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Flii 3.4 




Fig 3.5 



Now we are able to calculate the probability that two points with separation x 
lie inside a common polygon, each point being at distance w or less from the 
boundary. This probability is the sum of the probabilities of the two mutually 
exclusive events: 



(3.52) 
Thus 

(3.53) 
(3.54) 



and 

(3.55) 



W 2 : S = <i>, CV<J>. 
Pr [Wi] = Pr [5 = <J>] Pr [C = <&] Pr [E, * *] Pr [E 2 * *] 

-TW + TW /(ttX) + o(w )-|2 



_ — 2tx/tt — 2tw 2 /(ttx)+o(w ' )[ -t 

= T 2 w 2 e~ 2Tx/n + 0(w 3 ), 

Pr[W 2 ] = Pr[S = cD]Pr[C#<J)] 

= g-^^n _ e - 2 ™' 2 /('™) + o(w 3 )i 

^ T 2 -Irx/ir + Oiw 3 ) 

= — w e 

TTX 



Pr [Wi U W 2 ] = tw 2 \t + 2/(ttx)] e~ 2rxhr + 0(w 3 ). 



This is the probability that /•] and r 2 with separation x lie inside a common 
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polygon, each point less than w from the boundary. As before, the density of x is 
2nxdx/D. Thus the mean of g(x) over all r\ and r 2 in the same polygon and 
within w of its boundary is 

.00 

(3.56) {2tttw 2 /D)\ [TX + 2/Tr]g(x)e- 2Tx/,T dx + 0(w 3 ). 

Let A w denote the area of the strip of width w inside the boundary of the 
polygon. Then by the same arguments as in previous cases a second expression 
can be obtained for the mean of gflki -r 2 ||) over all r x and r 2 in the same polygon 
and near the boundary; namely, 

(3 , 7) ^Jf gfrl _ rJ|) ^]. 

For small w, da { = w ds t where ds denotes differential length along the boundary 
of the polygon. Equating our two expressions we get 

(3.58) 1^41 1 S(\\r l -r 2 \\)ds 1 ds 2 ] 

= (2ittw 2 /D) [rx +2/Tr]g(x) <T 2ti/ " dx + 0(w 3 ). 
Jo 

Letting w go to and M and D tend to infinity so that D/M-*E(A)= tt/t 2 , we 
get 

(3.59) E[JJg(||r 1 -r 2 ||)^ 1 d5 2 ] = 27r| (2r + irx)g(x) e~ 2rxU dx. 

Choosing g(x)= 1 we arrive at the result 

(3.60) E[L 2 ] = tt 2 (tt 2 + 4)/(2t 4 ). 
On the other hand, choosing g(x) = x 2 we get 

(3.61) ^[j \\\r 1 -r 2 \\ 2 ds 1 ds 2 ^=~[4 + 37r 2 ]. 

Let r L be the center of gravity of the perimeter of the polygon. Then 

(3.62) \\r x - r 2 f = \\r x - r L f 4- ||r 2 - r L \\ 2 + 2\\r x - r L \\ \\r 2 - r L \\ cos a 12 
where a i2 is the angle at r L between lines to r x and r 2 . By symmetry 



(3.63) 




J Ih-^ll IK-^lII cos a xi dsi ds 2 = 


and 




(3.64) 


w 


||ri - r L f dsi ds 2 = 1 1 \\r 2 -r L \\ 2 dsi ds 2 . 
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Thus 
(3.65) 
i.e., 
(3.66) 



IE f 



y^* w 



||ri-/ij| dsi ds 2 



E[LI L ] =— 4 [4 + 3n 2 ] 

OT 



where I L is the moment of inertia of the perimeter about its own center of gravity 
(i.e., Il = \ II' - — ill ds where r is on the perimeter). 

Some additional features of random lines in the plane have been presented by 
R. E. Miles (1964) and are repeated here without proof. 

1) For random lines obeying a Poisson field with intensity t there are on the 
average t 2 /V intersections per unit area. The associated angles of intersection 
are mutually independent with common probability density I sin <I> (0S4>S= w). 
Thus random lines can be expected to intersect each other at angles close to right 
angles much more frequently than at angles close to 0° or 180°. 

2) Let W be the diameter of the largest circle contained in a polygon formed 
by random lines as given above. Then 

VT[W^w} = l-e~~ TW , 

or W is distributed as a negative exponential random variable with mean t" . 

3) Let L/c denote the perimeter of a k -sided polygon formed by the random 
lines. Then IrLk/ir is \ w ' tn 2(fc-2) degrees of freedom. In particular, the 
perimeter of a triangle is negative exponential with mean value tt/t. A corollary 
is that the mean length of a side for a class of k -sided polygons is (k -2)ir/(kT) 
which increases to tt/t as fc^oo. Thus the average length of a side of a many 
sided polygon can be expected to be about twice the average length of a side for 
the entire class of polygons and about three times the average length of a side of 
a triangle. 

4) Pr[AT = 3] = 2-7r 2 /63=.3551. 

For 5«t~ 1 and A«t~ the probability densities of S and A are 
(12-7r 2 )T/(67r)+ 0(t 2 5) and ctA x/2 +0(t 2 ) respectively where 



1 



* 7T » 7T 

o Jo 



[2 sin <t> sin tp sin (<& + i/0] dtpd® 



5) The random variables N, L, and A corresponding to the number of sides, 
perimeter, and area of a polygon have the covariance matrix 



N 



L 



N 


(tt 2 -8)/2 


ir(7r 2 -8)/(2r) 7r ( 17 . 2 -8)/(2 T 2 )" 


L 


7r(7r 2 -8)/(2T) 


77- 2 (7t 2 -4)/(2t 2 ) 7r 2 (7r 2 -4)/(2T 3 ) 


A 


tt(77 2 -8)/(2t 2 ) 


77 2 (77 2 -4)/(2T 3 ) 7T 2 (77- 2 -2)/(2t 4 ) 
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Furthermore, 

E[NA 2 ] = 7T 4 (87r 2 -21)/(21r 4 ), 

E[LA 2 ] = 8tt j /(21t 5 ), 

E[LA m ' 1 ] = 2TE[A m ]/m (m = l,2, • • •)• 

6) Let V be the set of intersections of the random lines. Define I k as the set of 
line segments / such that / joins two points of V, there are exactly k other points 
of V on /, and / is part of one of the random lines. Similarly define J k as the set of 
line segments / such that / joins two points of V, there are exactly k other points 
of V on /, and / is not a part of one of the random lines. Thus, for example, I 
and / are the sets of all sides and diagonals, respectively, of the polygons 
formed by the random lines in the plane. If 5 is the length of a member of I k , 
then 4tS/tt is \ w ' tn 2(& + 1) degrees of freedom. If T is the length of a 
member of J k , then 4tT/tt is x with 2(k + 2) degrees of freedom. Miles presents 
similar results for random lines which have a nonzero thickness. 

The derivation of the exact distribution of N, L and A is still an open question. 
Empirical distribution functions for these variables can be obtained by Monte 
Carlo methods. Some early results from a computer simulation of random lines 
in the plane by Stuart Dufour 1 are the following: 

p 3 = Pr(iV = 3)s0.36 

p 4 = Pr(W = 4)3 0.38 
(3.67) p 5 = p r (jV = 5 ) S 0.19 

/? 6 = Pr(JV = 6)s0.054 
p 7 = Pr(A r = 7) = 0.010. 



For r = 1 



(3.68) 



Pr (AS 0.05) s= 0.13 Pr (is 0.5) = 0.05 

Pr(AS0.10)s0.18 Pr(LSl.0) = 0.11 

Pr(A=S0.25)s0.27 Pr(L§ 2.5)^0.26 

Pr (A g 0.50) s 0.38 Pt(LS 5.0) s 0.51 

Pr (A §0.75)s= 0.45 Pr (Lg7.5)s0.67 

Pr (A §1.00)30.50 Pr(Lg 10.0)30.79 

Pr (A §2.50) 3 0.67 Pr(L§ 15.0)30.92 

Pr (A §5.00) s 0.80 Pr (L§ 20.0) 3 0.98. 
Pr (A §10.00)3 0.90 
Pr (A §15.00)30.95 



1 Unpublished memo. 
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These rough empirical distributions were obtained from an analysis of 947 
polygons formed by 65 random lines. Only polygons within a radius of 32.5 were 
considered . Note the similarity between Pr (N = j) and Pr ( Y = / - 3 ) where Y is 
Poisson distributed with mean 1. 

Along similar lines the following have been transmitted by R. E. Miles in a 
private communication to H. Solomon in September 1969. 

p 3 .355065933 (exact) p 8 .00207(0) 

p 4 .3814(0) p 9 .00026(0) 

(3.69) p 5 .189(5) p 10 .000026(0) 

p 6 .0586(5) p n .0000021(5) 

p 7 .0127(5) p 12 .00000015(0). 

Miles has some suspicions about these results for he conjectures that p n ~ e n as 
n -*oo, an assumption not employed in obtaining the values listed above. Miles 
then fit the equation 

(3.70) p„ = e n + e ~ 2n (w +xn) + e 3n (y + zn) 

to the four known relations: Xp„ = l, X«p„=4, Y.n p„ = (tt~/2) + 12, p 3 = 
2 — (it /6) obtaining 



Pi 


.35507 


Ps 


.00237 


P4 


.36686 


P9 


.00047 


Ps 


.20304 


Pio 


.00010 


P6 


.05530 


Pn 


.00003 


Pi 


.01195 


Pn 


.00001 



This seems to be rather inaccurate. 

Miles also reports on two long runs of simulation of random polygons on a 
computer by a colleague wherein he lists the frequencies and relative frequencies 
as in Table 1 . 

To achieve three place accuracy, it appears that several hundred thousand 
polygons may be required! 

Analytically, the problem seems too formidable. It is possible to express p 4 as 
a complicated 3-fold integral; also for the polygon containing the origin or any 
arbitrary fixed point in the plane, p 3 = (tt 2 /6)[25- 36 log 2] = .07681381. 

Anisotropic lines in the plane: An application. It is possible to view the 
trajectory of a car produced by its time and space coordinates on the highway as 
a straight line in a plane if the car travels at a constant speed once it enters the 
highway and then never leaves the highway. We assume no change in a car's 
velocity when it overtakes another car, or is overtaken by another car. The 
Poisson process is the random device governing car entrance times or 
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Table 1 









Rel. frequencies 




i 


ii 


(combined) 


3 


3635 


14082 


.35900 


4 


3952 


15055 


.38500 


5 


1893 


7227 


.18500 


6 


586 


2163 


.05600 


7 


141 


503 


.01300 


8 


29 


98 


.00260 


9 


3 


5 


.00016 


10 








.00000 


11 








.00000 


12 





1 


.00002 




10,239 


39,135 


1.00078 



equivalently car positions, and the speed distributions for each vehicle are 
assumed to be identically and independently distributed (i.i.d.). Thus we can 
view the trajectory of any car as a random line in the plane and we will shortly 
formalize this notion. The intersections of these random lines in the plane will 
represent time and space coordinates where an overtaking occurs. 

Let one of the vehicles be an observer car (arbitrary line). The number of 
intersections of the arbitrary line (observer car) by the other lines determines the 
number of overtakings of slower cars made by the observer car plus the number 
of times it was overtaken by faster cars. We will develop this distribution and 
also the distribution of faster car overtakings of the observer car and overtaking 
of slower cars by the observer car. This requires a formulation for an anisotropic 
Poisson field of random lines from which a different proof of Renyi's result 
(1964) and the Weiss and Herman result (1962) can follow. These authors 
achieved their results without employing a geometrical probability context which 
we now develop. 

We have already formalized the notion of straight lines distributed "at ran- 
dom" throughout the plane. We will describe the plane in terms of (f, x) coor- 
dinates, where subsequently the t axis will be employed to register time of arrival 
of cars at a fixed point on a highway and the x axis will in similar fashion report 
on spatial positions of cars on a highway at a fixed point in time. The time 
invariance property for Poisson processes will insure that the conditions will 
prevail at any point in time. Any line in the (t, x) plane can be represented as 



(3.71) 



p = t cos a + x sin a, 



-oo<p<oo, 0^a<ir, 



where p is the signed length of the perpendicular to the line from an arbitrary 
origin O, and a is the angle this perpendicular makes with the t axis. Note that if 
the intersection of the perpendicular with the line is in the third or fourth 
quadrant, p is taken to be negative. Assume the set of lines {(/?,, a,): / = 
0, ±1, ±2, ■ • •} constitutes a Poisson field. 
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This definition of randomness for lines in the plane also has the property that 
the randomness is unaffected by the choice of origin or line to serve as t axis, 
since it can be demonstrated that except for a constant factor J dp da is the only 
invariant measure under the group of rotations and translations that transforms 
the line (p, a) to the line (p', a'). We now employ it as a point of departure to 
initiate discussion of a nonhomogeneous Poisson field of random lines. To 
achieve this we will relax one of the conditions and ask only that the a, be 
identically and independently distributed (i.i.d.). This will permit a range of 
velocity distributions for the auto and make the model more realistic. The 
random lines are now labeled anisotropic lines in the plane. 

For ease in the algebra of our traffic flow models, we will employ instead of a, 
an angle formed by the intersection of the / axis with a line in the plane and we 
label this 6 where v = tan 6. Also we will only be concerned with those lines 
where p, falls in the second or fourth quadrant since this will yield all positive car 
velocities. The inclusion of the p t in the first and third quadrants does not 
complicate the mathematical development, but they are not relevant. Thus, 
a =2ir + d and the lines of interest will now be parametrized by (p, 6) where 

(3.72) p = -t sind + xcosd, 0^0 <\tt. 

Equation (3.72) takes care of the sign of p, for it insures that p will be positive if 
it is in the second quadrant and negative in the fourth quadrant. 

The set if of lines {(p„ a,): i = 0, ±1, ±2, • • •} is not anisotropic and we require 
invariant measure only under translation. We look into this situation because it 
will be helpful in our traffic flow models. Under this constraint, we now have the 
same conditions except that the orientation angles a, of each line are i.i.d. 
random variables with common distribution function in the interval (0, rr). Thus, 
| dp da is no longer the appropriate measure and the origin can still be arbitrarily 
chosen but only at any point on a specific and fixed t axis because invariance is 
preserved now only under translation. 

The orientations 6 t are independent and identically distributed with common 
distribution F in the interval (0, it/ 2), and further the sequence of values {#,) is 
independent of (p,). This is equivalent to the statement that the velocities of cars, 
namely v t = tan 0„ are independent and identically distributed with common 
distribution G on (0, oo) and thus (u, = tan #,) are independent of (p,-). 

When 6 = 0, p = 0, the traffic flow is characterized by a distribution of time 
intercepts on the t axis; when 6 = tt/2, p = 0, the traffic flow is characterized by 
a distribution of cars spaced along the x axis. For any other value of 6, the traffic 
flow is measured along a trajectory line. In the traffic literature, trajectories for 
low density traffic flow (no delays in overtaking) may be assumed to be linear in 
the time-space plane. Thus in any development, we must employ the appropriate 
measure to characterize distributions of traffic flow in such matters, for example, 
as distribution of number of overtakings. For our purposes where Poisson 
processes are the underpinning for traffic flow in both spatial and temporal 
processes the evaluation of the appropriate Poisson intensity parameter will be 
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paramount as will be the relationships between these parameters for different 
measures. 

As noted above, each line is parametrized by (p, 6), where g $ < n/2, and p 
is the length of the perpendicular from the origin to the line. It is given a positive 
sign if the line cuts the t axis at time r > 0, and a negative sign if it cuts the t axis 
at time t <0. A glance at Fig. 3.6 will indicate cars faster and slower than the 
observer car. 



faster car 



observer car 




Fig. 3.6 



Label N L the number of cars intersecting the observer car before the point 
t = L cos d , x = L sin 6 , i.e., in the line segment L of the line / = (0, 6 ) stretch- 
ing from the origin to the point. Let N„ denote the number of cars with the 
opportunity to intersect the observer car on this stretch. Then N p is the number 
of cars with —L sin 8 <p<L cos O - 

Now any one of these N L vehicles actually is overtaken or overtakes, given 
its d, if 



(3.73) 



(i) 0<f?<6>o and 0<-p<L sin (O o -0) 
or -Lsin(0 o -0)<P<O, 
(ii) o <0<it/2 and < p < L sin (6 - o ). 



Graphically we show (3.73) in Fig. 3.7. 
Define /* = P{N L = 1 1 N p = 1}; then 



(3.74) 

H. = P{0 < 17 < L 1 3 exactly one random line (p, 6) with -L sin 6 < p < L cos 6 ] 
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(i) 




and 

(3.75) 



-tn: 



. tt/2 .Lsin(0-e o )-. 
+ 
'-L sin (e o -0) J(? Jo 

• dF(0, p\3 exactly one random line (p, 0) with —L sin 8 <p <L cos 6 ). 



Now the distribution of 6 is independent of the distribution of p and the 
condition depends only on p. Moreover the conditional distribution of p given 
-L sin 6 <p <L cos O is uniform on the interval (— L sin O , £ cos O ) since the 
p's are from a Poisson process. 

Therefore we may write 



(3.76a) M 



j-«o r° 

Jo J-Lsin(«o- 



+ 



W2 



Lsin («-«„) 



^p 



L cos &o + L sin O 



or 



(3.76b) ii ■■ 



1 Lsin(fl -fl) 
o L cos 6o + L sin O 



dH(6) + 



■nil 



L sin (g - ) 
e Q L cos O + L sin # 



<///(*) 



dH(0). 
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If is uniformly distributed on [0, tt/2], then dH(d) = dd/(n/2) and fj. = 2/tt as 
developed in the previous section for isotropic lines. 

The number of cars N L is the number of cars with -L sin 9 <p <L cos O . 
Because of our Poisson assumption, N L is Poisson with parameter X(L cos d Q + 
L sin do)- A random member of this set has probability p of actually overtaking or 
being overtaken and so the number of such overtaking occurrences is Poisson with 
parameter (where A is the intensity of the process of cars entering the highway) 



(3.77) 



tx\L(cos 6o + sin 6 ) = XL 



sin (00-6) dH (0)- 



rr/2 



sin (6 -8 Q )dH(d) 



«o 



Since v = tan 0, we label G(v) as the distribution of the velocity and replace 
dH(0)ml\\dG(v). Also 



(3.78) 
and 

Thus, 
(3.79) 



sin (6»o- 6») = 



sin do 



v cos 0o 

/■I , 2vl /2 ~ ~77~, 2\l/2 
(1 + V ) (1 + V ) 



sin (9 - O ) = 



V COS O 

2x1/2 



sin O 

2 -.1/2- 



(1 + iT) 1 " (1 + t/) 



f tanfl " sin 8o — v cos 9o 
ix (cos 6\> + sin d ) = — 2 1/2 — d//(i> ) 

Jo (1 + U ) 



Jtand,, (l+lO 

/u, (cos 0o + sin O ) = cos O 



v cos O - sin d ,_, ^ 



J^j2^G(v) + 



o (! + «') 



2^72^G(y)J. 



o (! + «') 



Therefore the intensity of the Poisson process generated by faster cars overtaking 
the observer car is 



(3.80) 



A' = XL cos O 



v-v 



(i + O 



m dG(v), 



2x1/2 



and the intensity of the Poisson process generated by the observer car overtaking 
slower cars is 



(3.81) 



A" = ALcos0 o 



V °~ V A^< \ 



o (HV) 



From this basic result arising from a merger of random lines in the plane and 
nonhomogeneous Poisson processes, all the models developed by Renyi, Weiss 
and Herman, and others can be derived as demonstrated in Solomon and Wang 
(1972). 
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Additional results for anisotropic random lines. In a previous section we found 
the density for lines in the plane which is invariant under translations and rotations 
of the coordinate axes. We have just seen for the highway model a much larger 
class of random lines, whose distribution is required to be invariant only under 
translations of the coordinate axes. Let us pursue this larger class in other 
contexts. Three examples of random lines are shown in Fig. 3.8. Figure 3.8(c) 
shows isotropically distributed random lines. In Fig. 3.8(a) and (b) the random 
lines are clearly anisotropic (i.e., their orientations are not uniformly distributed). 

We now examine the measure of lines intersecting a set. Much of the work that 
follows is due to Dufour (1972). 



, 



(a) 





(c) 



As before we describe the position of a line in terms of the coordinates (p, 6), 
where p is the signed distance from the origin to the line (p < if the line passes 
beneath the origin), and 8 is the angle between the x-axis and the perpendicular to 
the line. (See Fig. 3.9.) Thus the position of a random line is determined by the 



— X<p < X 

< < TT 




origin 



Fig. 3.9 
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values of the random variables p and 0. We denote by F(8) the cumulative 
distribution function of 6, and let J dF(0) denote integration over the probability 
density corresponding to the distribution function F(d) when F(8) is absolutely 
continuous, summation over the mass function when F{8) is discrete, and 
integration over the singular density when F(0) is singular. 

Our translation-invariance assumption implies that the orientation of a 
random line is independent of its distance from the origin. The translation- 
invariance assumption also implies that for any given 6 the density of p is dp. Thus 
the joint density of p and 6 is dp dF{6 ) and the measure m ( W) of all positions of a 
random line satisfying some condition W can be written 



(3.82) m(W) = 



dpdF{6) 



where the integration covers all positions of the random line such that W is 
satisfied. Note that in the isotropic case this measure becomes (I/77) dp dd, which 
differs by a factor of 1/tt from the measure usually discussed in integral geometry. 
At this point we should introduce some basic equations, which we shall use 
repeatedly throughout this discussion of random lines. Let dG denote the density 
for the positions of a random geometrical figure K a (e.g., a random line). Denote 
by m(W) the measure of all positions of K a such that some condition W is 
satisfied, i.e., 

(3.83) m(W)= \ dG. 

Jw 

If AT is a random variable depending on the position of K a , the mean value of X 
given that K a satisfies W is 

\ w XdG 



(3.84) E{X\W) = -- 



m(W) 



where the integration is taken over all positions of K a such that the condition W is 
satisfied. In particular, if we have two conditions W x and Wo such that W x implies 
W , then if we define 

_ J 1 if Wi is satisfied, 
[0 otherwise, 

we find that the probability that K a satisfies W u given that it satisfies W , is 

m{W x ) 



(3.85) P[W X \W Q ] 



m(W Q y 



Returning to random lines in the plane, let us calculate the measure m (K) of all 
positions of a random line such that it intersects a (measurable) set K in the plane. 
Define the thickness T(0) of the set K in the direction 6 to be the length of the 
projection of K onto a line with direction 0. (See Fig. 3.10.) Then the integral j dp 
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Total length T(0) 




Fig. 3.10 

of all positions of a line with perpendicular direction 8 such that it intersects the set 
K is T(8). Thus 



m(K)- 



(3.86) 



T(0)dF(e) 



■ E(T) 



where the notation £'(7') denotes the mean thickness of the set K in a direction 
perpendicular to a random line. 

Let us consider some simple examples of sets K and distributions F{6). First we 
give expressions for the thickness T(0) in the direction of five simple convex 
regions for which T(6) is easily obtained geometrically. 



(3.87) 



(a) r o (0) = 2ro, 

(c) T o (O)=Si\cos9\ + S2\sin0\, 



s cos 0, 



s cos [6 








-) 


77 „ IT 


i) 


6 2' 


2tt\ 

T/' 


71 ^ ~ 57T 



(b) r„(0) = Jt|cos(0-<p)|, 

(d) r (fl) = r(l + |cos«|), 

5tt 

6 



S; (/ S; 7T, 



(e) r (e) = 



See Fig. 3.11 (a)-(e) for diagrams. 

For the circle of radius r, the thickness is a constant, It. Thus the mean 
thickness is 2r, and the measure m(K) of the positions of a random line 
intersecting the circle is 2r regardless of the distribution F(d) of the orientation 
of the random line. 
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<a) 



(e) 





(hi 




(J) 



(e) 
Fig. 3.11 

Note that for a connected set that is not convex, the thickness T(9) is the same 
as the thickness of the convex hull of the set. Thus, for example, the thickness 
T(6) of two adjoining line segments of length .S making an angle of tt/3 is the 
same as the T(6) given above for the equilateral triangle. 

As an example of F(0), consider the probability distribution 



(3.88) 



P[e = o] = - 



With this distribution, the mean thickness of a half circle of radius r, for 
example, is 



(3.89) 



E(T) = hr(l+cosO)]+l[r(l+cos7r/2)]-- ^ 



2 l x " V v " 2 

As a second example of F(0), let us consider the isotropic distribution 



(3.90) 



dF(e) = -d8, 

77 



OSflSTT. 
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Suppose K is a connected set. A line intersects K if and only if it intersects the 
convex hull of K. In a previous section we showed that if G U K is the set of lines 
intersecting a convex region with perimeter L, 



f j dp, 



(3.91) dpd0 = L. 

Since the integral f dp over all lines with perpendicular direction 8 is just T(8), 
the above relation can be rewritten 



(3.92) 



T(8)d8 = L. 



Thus with respect to an isotropic random line, the mean thickness of a convex 
region of perimeter L is just 

(3.93) E(T) = L/n. 

Hence, for an arbitrary connected set K on the plane, m(K) is tt times the 
perimeter of the convex hull of the set. 

We shall now obtain expressions for the probability that a random line 
intersects a set K x given that it intersects another set K t) . It is necessary that we 
consider conditional probabilities of this sort since the unconditional probability 
that a line, random over the whole plane, will intersect any bounded set K x is 
clearly zero. We shall consider three cases: one where K] is a subset of K a , one 
where K u and K } have a nonnull intersection, and one where K and ATi are 
disjoint. 

For the case where K x is a subset of K , we need only substitute our expres- 
sions for the measure m(K) to obtain the following: 

A random line intersects a measurable set K . The probability that it also 
intersects a subset K\ of A"„ is 

(3.94) /'[line hits K, \ hits *<>]=§— 

where E(T t) ) and F(Ti) are the mean thicknesses of K () and K\, i.e.. 



(395) E(T,)= \ T,(8)dF(d\ i = 0, 1 . 

Jo 

An isotropic random line intersects a connected region K - It intersects a 
connected subset K x with probability 

L* 

(3.96) Ffline hits A', | hits AT ( ,1 = yi 

where L* and Z.f are the perimeters of the convex hulls of K and Ku respec- 
tively. In particular, if K„ and A"i are convex with perimeters L and L\, 

(3.97) P[line hits K x | hits K ] = --. 
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Let us now consider the case where Ki and K have a nonnull intersection, but 
K\ is no longer necessarily contained in K . We shall assume further that K and 
K\ are connected sets, since without this assumption the problem seems to be 
intractable. Given that a random line hits K , the probability that it also hits K\ 
is 

,-. «^n „r,. ,■ ,, i, • „ t m(AT and.Ki) 

(3.98) Pfhne hits #1 1 hits ^oH— 1 - 2 -; — r~ - 

m(K ) 

where m(K and iiTi) denotes the measure of all lines intersecting both K and 
K\. The measure m(K or Ki) of lines hitting A" or K x can be written 

(3.99) m(K orK 1 ) = m(K ) + m(K 1 )-m(K andK l ). 
Thus 

m (/JT ) + m (Ki) - m (K or K x ) 



(3.100) Pfline hits K l \hitsK ] = ■ 



m(*o) 



and we have the following: 

Let K and K x be the intersecting connected sets on the plane. A random line 
which intersects K also intersects K x with probability 

(3.101) P[lme hits K x | hits K \ = — — 

E(lo) 

where E(T ), E(T X ) and E(T 0l ) denote the mean thicknesses of K , Ki and 
K U K\ , respectively. 

Note that when Ki<^K , E(T i) is E(T ) and the probability given in this 
theorem reduces to that given before. 

For an isotropic random line 

(3.102) P[line hits /STi| hits i<: ] = — — -^ - 

where L*, L*, and L*i are the perimeters of the convex hulls of K , K x and 
K UKi, respectively. 

Now consider the case where K and K\ are disjoint, connected sets. A line 
intersects both K and K x if and only if it intersects both of the shaded regions 
(KoUQf and (K x U Of pictured in Fig. 3.12. 

The point Q is the intersection of two lines just touching the edges of K and 
K\ and crossing between them. The sets (K U Q)* and (Ki U Q)* are the convex 
hulls of K U Q and Ki U Q, respectively. Thus 

m(K and Ki)=m((K \JQ)* and {K x U Of) 
(3.103) 

= m((K UQY f )+m((K 1 UQ)*) 

-m((ff UQ)*or(A-iUQ)*). 

Since (K l)Q)* and (i^UO)* intersect, a line hits (tf UOf or (KiUQ)* if 
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(K UO)* 



(*iUQ)* 



Fig. 3.12 



and only if it hits the convex hull of (K UQ)*\J(KiUQ)* which is just 
(/foUATi)*, the convex hull of AoU/Ci. Thus 

(3.104) m(K andK l ) = m((K UOf) + m((K l UOT)-m((K UKif) 

and we have the following. 

Let K and Ki be disjoint, connected sets on the plane. A random line which 
intersects K also intersects K l with probability 

E(r * o )+£(r 1 * o )-E(r *i) 



(3.105) 



P[Iine hits AT! | hits #(,] 



£(r ) 

where E(T {) ) is the mean thickness of K and E(r* ), E(T* Q ), and E(T*i) are 
the mean thicknesses of the convex hulls of K \-)Q, KiUO and /f UJfi, 
respectively, where Q is the intersection of two lines just touching the edges of 
Kq and Ki and crossing between them. 
For an isotropic random line 



(3.106) 



P[line hits #i| hits /C ] = 



Lqq + Liq- 



l * 



r * 



where L*q, L*q, L*\ and L* are the perimeters of the convex hulls of K (JQ, 
Ki U Q, K \JKi and K , respectively. 

Consider the mean length of intersection of a random line with a set. Suppose 
a random line intersects a measurable set K on the plane. The mean value of the 
length C of line within K is 



£(C) = 



(3.107) 



\\CdpdF{d) 
m(K) 

\\CdpdF{6) 
E(T) ■ 



The integral \ C dp is just the area A of K. (This fact is easy to see by breaking K 
into thin parallel strips as in Fig. 3.13 and adding up the areas of the strips.) 
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_ Total lenglhC 




Fig. 3.13 



Hence 
(3.108) 



E(C)- 



| A dF(6) A 



E(T) E(T) 
If AT is a connected region and the random line is isotropically distributed, 

7TA 



(3.109) 



E(C)- 



where L* is the perimeter of the convex hull of E. In particular, when K is 
convex with perimeter L we have 



(3.110) 



BiC^f, 



one of the results of Crofton. 

Now consider random lines intersecting within a region. We shall now find the 
probability that the two independent random lines, both intersecting a region K, 
intersect within K. Suppose the line segments formed within K by the first line 
have total length C x and perpendicular direction 9 X . A second random line 
hitting K intersects these line segments with probability 



(3.111) 



P[Intersect in K \ C u 6 y ] = - 



E 2 (T) 
where Ti(02) is the thickness of the intersection of the first line with K, and the 
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Tola I knglh 

C\lCOs(S;-«, + ^)l 

= r,lsin<e ; ~0i)l 



total length 
C, within K 




X .1X1*. 



Fig. 3.14 



notation E 2 indicates expectation with respect to the distribution F 2 (6) of the 
orientation of the second line. From Fig. 3.14 we see that 



(3.112) 
so that 

(3.113) 



E 2 (r 1 )=C 1 E 2 (!sin(6»2-6» 1 )j) 

CjEzflsin (02-001) 



/"[Intersect in AT | G,0i] = 



E 2 (T ) 

Integrating over all possible positions of the first line, we get 

E[C l E 2 (\sin (02-001)] 



(3.114) P[Intersectin AT] = 

The numerator is 



E[C,E 2 [\sm (02-0i 



)|]]=f fc. 



E 2 (T) 



|sin (02-001 ^F 2 (0 2 ) dp, dFtfJ 



(3.115) _ J; [j Ci rf Pl ] fo" |sin (02- 0Q1 df 2 (0 2 ) rfFi(fli) 

E,(F) 

= A £g |sin (02 - 0Q1 dF 2 (6 2 ) ^(00 
Ei(T) ~ ' 

Thus we may write two independent random lines whose perpendicular direc- 
tions have the distribution functions F\ and F 2 and intersect a region K. They 
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intersect each other within K with probability 

A \;\; |sin (0! - 2 )| <*Fi(0i) dF 2 (0 2 ) 



(3.116) P[IntersectinAT] = : 



E 1 (T)E 2 (T) 



where E\{T) and E 2 (T) are the mean thicknesses of AT with respect to the 
distributions F x and F 2 , respectively. 
Since 



(3.117) - 

TT J 



2 
|sin(0i-0 2 )|d0i=-, 

TT 



the factor I^j^ |sin(0i-0 2 )| dFi(9i)dF 2 (d 2 ) is just 2/7T if either of the lines is 
isotropically distributed. 

If one of the lines is isotropically distributed, say the one corresponding to 
subscript 1, 

24 
(3.118) P[Intersect in K] 



7 rF 1 (r)F 2 (r)- 

If, in addition, K is a connected region, 

2A 



(3.119) P[Intersect in K] = 



L*E 2 (T) 



where L* is the perimeter of the convex hull of K. If both lines are isotropic and 
K is connected, 

(3.120) P[Intersect in K] = -j*t 

which, when K is convex with perimeter L, reduces to 

2ttA 

(3.121) P[IntersectinA"]=— — , 

another of Crofton's well-known results. 

We now examine angles between intersecting random lines. For any two 
random lines with perpendicular directions 0i and 2 , the angles between the 
lines are |0i-0 2 | and 77--|0i-0 2 |, whose probability distribution can be cal- 
culated from 



(3.122) P[\e 1 -0 2 \^a] = 



dF 1 (6 l )dF 2 (0 2 ). 

|«,-S 2 ISa 



Thus, for example, if either line is isotropic, the angle between the lines is 
uniformly distributed from to it. 

On the other hand, suppose we pick two random lines which are known to 
intersect within some finite region K of the type considered before. The dis- 
tribution of the angle between these lines is not the same as the distribution of 
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the angle formed between two particular random lines. Let I\ 2 denote the event 
that the two random lines intersect within K. Then 



P[|0i-0 2 |<a|/i2] = 



p[\ei-e 2 \<a\i 12 ] 



(3.123) 



P[hi\ 
_ JU- fc i<a P[In 1 6u e 2 \ dF 1 (6 1 ) dF 2 {6) 

P[h 2 ] 

From our previous result, we have 



(3.124) 
and 
(3.125) 
so that 



P[/l2|6»l,02] : 



^4|sin (^i — <? 2 ) 



PUn] 



£ 1 (T)E 2 (T) 

A K£ [sin (fl, - 6 2 )\ dF 1 (d l )dF 2 (6 2 ) 
E,(T)E 2 {T) 



,3,26, ni»,-«.i<.i^- tey;^ ( ^y - 

J J |sin (0, - 2 )| dFi{6i) dF 2 (6 2 ) 
If either of the lines is isotropically distributed, say, the second 



(3.127) 
and 



77 J 



|sin(0i-0 2 )|<*02 = - 







- f \sm(6 l -0 2 )\dF 1 (di)d6 2 = - [| |sin <p\ dcp dF x {6i) 

77 J J 7T J J 

ia,-e 2 l<i l»l<a 



(3.128) 

Thus for ^ a ^ 7T 
(3.129) 



1 f l • l 
= — [sin (p\ 

77 J|,p|<a 



d(p. 



P[\e l -e 2 \<a\I 12 ]=- sin <p d<p. 

2- •>\<D]<a 



Therefore if two random lines intersect within some finite region in the plane, 
the angle q> formed at their intersection has the probability density 2 sin <p for 
^ tp S 77, if either of the intersecting lines is isotropic. 

Now consider an infinite collection of random lines, independently and iden- 
tically distributed over the plane. Define the intensity t of the collection of 
random lines to be the mean number of lines intersecting a circle of unit 
diameter. We shall find the probability distribution of the number of lines 
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intersecting a set K made up of the areas within a finite number of closed curves 
on the plane. 

Let C\, C 2 , • • • be concentric circles, the circle Q having diameter /. Let Nj 
denote the number of random lines intersecting Q, and define 

(3.130) Xj=~Ni-N H u 

that is, Xj is the number of random lines hitting C ; but not Q-i. Since the 
random lines are independently distributed, the random variables X\, X 2 , • • ■ 
are independent. Since the mean thickness E(T) of a circle is its diameter, a 
random line which hits C n also intersects C, within C n (/ = «) with probability 
j/n. Thus the probability that a line hitting C„ hits Q but not Q-i is 

(3.131) i._izl = I 

n n n 

and 

/N\/l\ k / l\ N » _fc 

(3.132) P[X, - = k\N n lines hit C n ] =( "J^-J (l--J . 

Since this does not depend on /', the unconditional probability P[Xj = k] does 
not depend on ;'. In other words, the variables X\,X 2 , ■ ■ • ,X„ are identically 
distributed. Since the Xj are independent and identically distributed, the law of 
large numbers yields 

(3.133) -iXj^T asn-»oo, 

n y = i 

that is, 

(3.134) >t as«-*oo 

n 

p 
where the notation -» indicates convergence in probability. 

Now consider any set K made up of the areas within a finite number of closed 

curves on the plane. Let K be contained in a large circle C„ of diameter n. 

A random line which hits C n also hits K with probability E(T)/n. Thus the 

probability distribution of Nk, the number of lines intersecting K, given N„, the 

number hitting C„, is 



P[N k = k\N n ] = ( N k n ) 



E(T) 



i n 



1 E(T)y*-" 

n J 



(3.135) 



1 N n (N„-l) (N-k + 1) 
k\ n n n 

,H E(T)\ n V N :~ k V" 

. [E (T)r[(i-yp)] . 
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p 
Since P[N k -k\N„] is a continuous function of NJn, the fact that Njn -» r as 

n ^oo implies 



(3.136) P[N K = k\N n ]^^[E(T)] k e~ rE(T) . 
Since /'[Nn = A: | N n ] is bounded, the above convergence implies 

(3.137) E(P[N k = k\N H ])-*- , (1>i 

k\ 

as n h»oo. This, in turn, implies 



(3.138) p[jv k =*]=- 



Jfc! 



and so N k has a Poisson distribution with mean tE(T), where E(T) is the mean 
thickness of the set K and r is the intensity of the random lines. 

Moreover the intersections of random lines with a line having direction <p are 
the events of a Poisson process with mean rate tE(\cos (8 — cp)\). In particular, 
the distance x from any point on the line to the first intersection by a random line 
has the probability density t£(|cos (tf -«p)|) c~ tE(|cos( "~' pJI> * for x>0. This result 
has already been developed in the highway model. 

For the isotropic case, we have the following simplifications. The number of 
isotropic random lines intersecting a region K within some closed curve has a 
Poisson distribution with mean tL*/tt. Furthermore, the intersections of the 
isotropic random lines with any particular line are events of a Poisson process 
with mean rate 2t/tt. Thus the distance x along the line from an arbitrary point 
to the first intersection has the density {2T/ir)e~ 2Tx/ " for xSO. This was also 
derived previously. 

Now consider the points of intersection of the random lines in a Poisson field 
of random lines on the plane. Due to the translation-invariance property, the 
points of intersection are uniformly distributed over the plane. The mean num- 
ber of these points of intersection within any region K can be found as follows. 

Suppose N K lines intersect the region K. Let M K denote the number of 
intersections within K of the N K "chords" formed by the lines intersecting K. 
(Since K may be nonconvex, each "chord" may consist of several colinear line 
segments.) Define 

1 if the /th and Mi "chords" intersect in K, 
otherwise. 

Then M K can be written 



N K N K 

(3.139) M K =l I a ik 
and 

(3.140) E(M K \NK) = ^N K (N K ~[)E(a !k ). 
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Since, by the previous result, N K has a Poisson distribution with mean tE(T), 

(3.141) 



E(M K )= l -[ T E(T)fE(a ik ). 



Moreover the probability that two lines intersect within K, given that they hit 
AT, is 



(3.142) 



E{a jk )- 



A \;\; |sin (fr - e 2 )\ dF(6 1 ) dF(8 2 ) 
[E{T)f 



Combining these expressions for E(M K ) and E(a jk ), we get the points of 
intersection of the lines of a Poisson field of random lines on the plane are 
uniformly distributed over the plane. The mean number of such intersections 
within a region K with area A is 



(3.143) E{M k ) = \t 2 a\ f |sin(0,- 9 2 )\dF{6 l ) dF(d : 

I Jo Jo 



) 



where r is the intensity of the Poisson field (i.e., the mean number of random 
lines intersecting a circle of unit diameter). 

The mean number of points of intersection of isotropic random lines within 
K is 



(3.144) 



E(M K ) = 



r 2 A 



The mean number of intersections outside of K among lines intersecting K 
can be obtained. The total number of intersections among the N K chords hitting 
K is \N K {N K - 1). Thus the number of intersections outside of K is 



(3.145) 

and 

(3.146) 



E(M K | N K ) = ^N K (N K - 1)[P[0, 1*0,]- E( aii )] 



E{M K ) = -[rE{T)f[P[e i * 0,]-£K)], 



or 



E{M K ) = ^r 2 



[EiTrfPiOi *6j\-A\ \ |sin (ft - 0,)| dF(0,) dF(ft)] 
Jo Jo J 

where P[0, ^ 8f[ is the probability that two random lines are not parallel. 



CHAPTER 4 



Covering a Circle Circumference and a 
Sphere Surface 



In his turn of the century book Whitworth (1897) was one of the first to 
consider problems relating to the coverage of a circle by random arcs. Suppose 
we place n arcs of size a uniformly and independently on the circumference of a 
circle. Then Whitworth's answer to exercise 667 in his volume can be interpreted 
as the probability that the part of the circumference that is not covered by an arc 
consists of exactly n—r connected components, that is, the pattern of arcs leaves 
n—r gaps on the circumference. He restricted himself, by the comment in 
exercise 666 "say c—na = ma" where c is the size of the circumference of the 
circle, to the case in which na <c. 

This restriction was apparently overlooked by F. Garwood (1940) who incor- 
rectly credits Whitworth with the evaluation of the probability of complete 
coverage of the circle by these arcs. Complete coverage corresponds to the case 
r ~ n. However, the restriction na<c implies that there are not enough arcs to 
cover the circle completely. Thus Whitworth's result excludes the case of 
interest, namely probability of coverage when it is possible to cover. Indeed, 
Whitworth's formula with r = n sums to zero identically for any value of a, as can 
be seen using elementary theory of finite differences, and thus cannot be correct 
when na>c. Darling (1953) and Shepp (1972), apparently because of Gar- 
wood's remarks, also mistakenly attribute this result to Whitworth. 

M. Baticle (1935) comes very close to evaluating this coverage probability 
while considering a problem of partitioning an interval, but he does not state 
when one should stop summing his series. To be correct, we must stop at the 
[c/a] + 1 term, where brackets indicate the greatest integer less than or equal to 
c/a. If one sums completely, as he indicated, one gets zero, as was the case for 
Whitworth's formula. 

W. L. Stevens (1939) thus appears to be the first to evaluate the probability 
that n random arcs of size a cover a unit circumference completely and he 
provides an ingenious solution. This probability, for a circle of circumference 
one, is 

(4.i) i-^i-ar'+Qa-iar 1 -- ■ . +( -i) k (£)(i- ^r 1 

where k = [1/a], the greatest integer contained in \ja. By using a geometrical 
argument, Stevens also solves generally for the probability that the arcs will 
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leave exactly / uncovered "gaps" between them. This is shown to be 

(«) (;)!(-,)"(;;,')(.-«-. 

There is a series much like (4.1) derived by R. A. Fisher (1929) as the null 
distribution of a test statistic for significance in time series analysis. Fisher (1940) 
noted the similarity between his series and Stevens' result and the link between 
the two problems. 

We now develop the Stevens solution and then add a number of additional 
results due to Siegel (1977). 

Random arcs on the circumference of a circle. Consider a circle of unit 
circumference. Let us drop n arcs each of length a, < a < 1, at random onto the 
circle with circumference equal to 1. We desire the probability that every point 
of the circle is covered by at least one of the arcs. 

We can identify each arc by its clockwise endpoint, and we can assume these n 
points (corresponding to the n arcs) are uniformly distributed over the circum- 
ference of the circle. Without loss of generality we can rotate the circle so that 
the clockwise endpoint of one arc falls in the twelve o'clock position on the 
circle. This arc is labeled 1 and the remaining arcs are labeled in the counter- 
clockwise direction from 2 to n. (See Fig. 4.1.) 




Fig. 4.1 

First we shall find the probability /(l) that there is a gap or uncovered portion 
of the circle after the rth arc, r specified. Let G r denote the event that there is a 
gap after the rth arc. Let F a denote the event that no clockwise endpoint falls in 
the portion of circle of length a extending in the clockwise direction from point 
1. Then there is a one to one correspondence between the configurations of arcs 
where G, occurs and the configurations of arcs where F a occurs. This cor- 
respondence can be seen by considering the operator T r which rotates arcs 
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r+l, r + 2, ■ ■ ■ ,n clockwise through a distance a. For any configuration C of 
arcs where G r occurs, T r (C) is a valid configuration of arcs (i.e., the ordering of 
the arcs is preserved) where F a occurs. On the other hand, if C is any configura- 
tion of arcs where F a occurs, then T r (C) is a valid configuration of arcs where 
G r (i.e., a gap after the rth arc) occurs. The effect of the operator T r is illustrated 
in Fig. 4.2. This one to one correspondence implies that 

P[G r ] = P[F a ]. 

But P[G r ] =/(l), namely the probability that there is a gap or uncovered portion 
of the circle after the rth arc, and P[F a ] = (1 — a) n ~ . Thus we have 



(4.3) 



/(!) = (!-«)" 




Configuration C 



6 7 
Configuration T 4 (C) 




Configuration C 



6 7 
Configuration T 2 (C) 

Fig. 4.3 



Configuration T 4 T 2 (C) 



In the same way we can find the probability /(2) that there is a gap directly 
after arc r x and directly after arc r 2 . Let G r ,. r2 denote this event. Then the 
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operator T n T n provides a one to one correspondence between configurations 
where G ri , r2 occurs and configurations where F 2a occurs. This correspondence is 
illustrated in Fig. 4.3. Thus 

P[G ri , r2 ] = P[F 2a ]. 

But P[G rur2 ] =/(2), namely the probability that there is a gap after the rith arc 
and a gap after the r 2 th arc and P[F 2a ] = (\-2a) n ~ l , so we have 

(4.4) f(2) = (l-2a)"-\ 

It is clear that we can use this same procedure to obtain the probability that 
there are gaps in i specified places. The distance between each pair of arcs 
enclosing a gap may be reduced by a (by shifting the arcs labeled by higher 
numbers a distance a clockwise), to produce a configuration with the portion of 
length ia terminating at 1 containing no clockwise endpoints of arcs, and 
conversely. Hence, if we let /(/') denote the probability of gaps after / specified 
arcs (regardless of what happens elsewhere) then 

(4.5) /»)-{< 1 - tar ' !"' 

[0, i>k, 

where k is the greatest integer less than I /a. 

To proceed with the solution we develop a theorem in probability concerning 
a random n -tuple whose components have two possible outcomes H or T. The 
components are not independent, but the probability that any h specified com- 
ponents out of the n are H, whatever the remainder, is known to be f(h). We 
wish to develop formulae for the probability f(h, t) that h specified components 
are H, t specified components are T and the remaining n-h-t are either H 
or T. 

First we shall evaluate f(h, 1). Let ;' be the component which is specified to be 
a T. Consider the set of all outcomes of the random n -tuple in which h specified 
components are H's. This set has probability f(h). Delete from this set all 
outcomes in which there is a T in component /, i.e., all outcomes with h specified 
H's and one specified T. The deleted set has probability f(h, 1). The resulting set 
has H's not only in the original h specified components but also in component /. 
The probability of the set that results is f(h + 1). Thus we have 

(4.6a) f(h)-f(h,l) = f(h + l) 

or 

(4.6b) f(h, l) = /(A)-/(A + l)=-A/(/i), 

where A denotes the difference operator. 

With a similar argument we can obtain the probability f{h, 2). Let / and m be 
the components that are to be T's. Consider the set of all outcomes in which h 
specified components are //'s and the component / is a T. This set has proba- 
bility f{h, 1). Delete from this set all outcomes with a Tin component m. The 
deleted set has probability f(h, 2). The members of the resulting set have H's in 
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the h specified places as well as in component m and a Tin component /. Its 
probability is f(h + \, 1). Thus 

f(h,l)-f(h,2) = f(h + l,l) 
or 

f(h,2) = f(h, l)-f(h + l, 1) = -A/(A, l) = A 2 /(/t). 

This indicates that, in general, for tf^n — h 

(4.7) f(h,t) = (-*)fth) = f(h)-tf(h + l)+ t -!^f(h + 2)---'(-l)'f(h + t). 

This expression can be proved using induction and the same argument as above. 
Assume it is true for all h and a particular t, any r . Consider the set of all 
outcomes in which h specified components are H's and t specified components 
are 7"s. This set has probability f(h, t Q ). Let / be any component whose value is 
not specified. Delete from the above set all outcomes with a T in component ;'. 
The deleted set has probability f(h, t + 1). Each member of the resulting set has 
H's in the h specified places as well as in component / and T's in the t specified 
places for T"s. This set has probability f(h + 1, t ). Thus 

(4.8a) f(h, t )-f(h, t +l) = f(h + 1, t ) 

or 

(4.8b) f(h, t +l) = f(h, t )-f(h + 1, t )= -A/(/t, r )= (-A)'° +l f(h) 

and the proof by induction is complete. This can be done also without the use of 
operators by substituting in the series expansion given above for f(h, t). 

In particular the probability that h specified events are n, and the remainder 
all T, is 

/(M-/0=(-Ar7(A) 
(4.9) =f{h)-(n-h)f{h + l) 

+ (w ~* x :;~*" 1 W )---(-irY(»). 

An easy way to evaluate this expression is to represent the difference operator 
A as 

where U is a unit shift operator; i.e., 

Uf(h) = f(h + 1) and U'f(h) = f(h+j). 
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Then 

f(h,n-h) = (-±) n - h f(h) 

= (l-U)"- h f(h) 
(4.10) "->> in-h 



i (":")(- vim 

n ~ h in~h\ 
= I . )(-l)'/(/i+7). 



Up to this point we have considered only probabilities in which the exact 
positions of the H's and 7"s are specified. The probability of h H's and (n-h) 

T's, regardless of their positions, is obtained by noting that there are ( I ways 

of selecting h components from the n -tuple. Thus the probability of exactly h 
tf'sis 

We can apply this result to obtain the solution to our problem of random arcs 
covering the circumference of a circle. Let the alternative H and T in the /th 
component of the n -tuple refer to the existence or nonexistence, respectively, of 
a gap after the /th arc. We have already evaluated the probabilities 

Ul-iar\ i^k, 

/(0= lo, />*. 

of gaps in / specified places. Thus we have: 

P[Nogaps]=£ ("V-iya-Ao- 1 

;=-o \]J 

(4.12) \"-i,n(n-l) „_, 

= l-«(l-a) + (l-2a) -•■• 

n(n-i)- ■ ■ (n-fr + 1) , 
ifc! (1 "* fl) ' 

(4.13) P[Onegap] = n' t I 1 ( n T 1 )(-iy(l-[/+l]ar- 1 , 

(4.i4) p[igaps] = (") k f ( n 7')(-iy'(i-[y+/>r 1 , 

(4.15) P[A:gaps] = (^)(l-fca)"- 1 

where k is the greatest integer less than I /a. 
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Thus the probability of covering the circumference with n arcs can be com- 
puted from P[Nogaps]=/(0). For example, if a=i /(0)=.4929 for n = 16, 
/(0)= .5596 for n = 17. Thus, we require 17 arcs to be at least 50% sure that the 
circumference will be covered. 

C. Domb (1947) considered a generalization of Stevens' problem. Intervals of 
length a are placed on R the real line, with left endpoints as the events of a 
Poisson process. The moments and the distribution of the proportion of the 
interval [0, y] covered by such segments is calculated, together with the proba- 
bility of complete coverage, by the use of a Laplace transform argument on a 
system of integral equations. These results can be transformed to give the 
corresponding results for the problem of N random arcs of size a on the 
circumference on a circle, when N has a Poisson distribution. By expanding his 
coverage probability result in powers of A, the intensity of the Poisson process, 
Domb is able to produce Stevens' formula for coverage by n arcs, with n fixed. 
In principle, a similar expansion would yield the moments and the distribution of 
that proportion of the circle covered by a fixed number n of random arcs. 
However, these are not produced by Domb because for the distribution, "the 
complete solution ... is very complicated" and in the case of the higher 
moments, because "they become rather cumbersome." Both the moments and 
the distribution are produced explicitly, by a method different from Domb's, in 
Siegel (1977), which we produce shortly. 

J. G. Mauldon (1951) generalized Stevens' result in a different way. If we let 
Vi, • ■ ■ , Y„ be the gaps between n points placed uniformly and independently 
on the circle, then P(maxi Sl - s „ YjS=a) is the probability that n random arcs of 
size a cover the circle. This is easily seen if we place the clockwise endpoint of 
one arc at each Y,. Thus Stevens' result is seen to yield the distribution of the 
largest Y,. Mauldon derived the moments and the distribution of the sum of the 
k largest of Y\, ■ ■ ■ , Y n . Thus the case k = 1 reduces to the circle coverage 
probability (1). 

Let Yi, ■ ■ • , Y n be gaps on the circle as in the previous paragraph. D. A. 
Darling (1953) found a complex contour integral representation of the charac- 
teristic function of the random variable W n =£" =1 hj(Yj) for quite arbitrary 
functions h h His method gives a unified treatment to problems arising in the 
study of infectious diseases, traffic flow, etc., as well as some problems in 
nonparametric tests for goodness of fit. Among results derived by the use of his 
method is Stevens' formula for the circle coverage probability. 

If we place arcs of size a sequentially and uniformly on the circumference of a 
circle, Stevens' result yields the probability distribution of N, the random num- 
ber of arcs at which complete coverage first occurs. This is because the proba- 
bility of coverage by n arcs (where n is fixed) is P(N ^n). L. Flatto and A. G. 
Konheim (1962) considered the problem asymptotically for a tending to zero (so 
that N increases stochastically without limit) and showed that 



fl/o] 



\*-l 



(4.16) E g (AT)=l- I (~1)* (1 **li a-'logfl-' as«\,0. 

k=i (ka) 
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About ten years later, P. J. Cooke (1972) suggested the asymptotic form of the 
first two moments of N. 

A. Dvoretsky (1956) considered coverage of the circle by an infinite number 
of random arcs of specified lengths but uniform independent placement. He 
showed that if the length of the rth arc is /, e (0, 1), then the divergence of the 
sum £, = 1 /, is necessary and sufficient for the arcs to cover almost all of the circle 
with probability one. He left open the corresponding problem of determining 
necessary and sufficient conditions on the sequence of lengths so as to cover all 
(not just "almost all") of the circle with probability one by exhibiting a sequence 
of lengths for which Y.7=\ U diverged, but which did not cover all of the circle with 
probability one. 

P. Billard (1965) gave a partial solution to Dvoretsky's problem by 
exhibiting a necessary condition and a (different) sufficient condition. M. 
Mandelbrot (1972) produced another partial solution and established for the 
first time that arcs of lengths /„ = l/(« + 1) will cover all of the circle with proba- 
bility one. 

L. A. Shepp (1972) finally provided the complete answer to Dvoretsky's 
problem. He showed that if the length of the rth arc is /, e (0, 1), then 

(4.17) £ e *PC?-i''> ,ao 

n = l n 

is a necessary and sufficient condition for these arcs, randomly placed, to cover 
all of the circle with probability one. Shepp also derived bounds on the proba- 
bility of coverage of a circle by n arcs of size a, and used these bounds to study 
the asymptotic distribution of the number just needed to cover. 

In this section we reconsider the problem of n arcs X\,---,X„ of length a 
placed independently at random on the circumference of a circle and present the 
development by A. F. Siegel (1977). Assume that the circumference has length 
one and that the centers of the arcs (or equivalently their clockwise endpoints) 
are uniformly distributed over the circle. We define the coverage to be 



(4.18) C (n . a) = M U X, 



i = l 



and the vacancy to be 

(4.19) D (nM) =\-C (nMh 

where fi denotes Lebesgue measure on the circumference. Thus C ( „. a ) and D (nM) 
are random variables representing the proportion of the circumference covered 
(respectively not covered) by the arcs X\, ■ ■ ■ , X n . Siegel derives formulae for 
the moments of coverage (i.e., the moments of C (n , a) ), the distribution of 
coverage, and its limiting distribution as n tends to infinity. 

Moments of coverage. The moments of coverage are found by deriving a 
recursive integral equation for the moments of D (n , a y Robbins' theorem (1944) 
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is the starting point, yielding a formula for the wth moment of D^ a) : 



(4.20) ED ( -.«) = 



Pa\ Z\, ■ ■ ' , Z m € 



U X-) dzu 

; = i / 



,dz„ 



where K denotes the circumference of our circle and the subscript a of P is the 
arc length. From the fact that X\, ■ • ■ ,X„ are independent and identically 
distributed sets, we have 



(4.21) EDr n .a)=\ [P a (z u ---,z„eXfi] n dz 1 , 



dz„ 



Using invariance of the integrand under permutations of the z, to order them by 
their magnitudes, 0Sz ( i)g- • ■Sz (m) <l, together with the fact that 



(4.22) 



Pa (Z 1 , ' ' ' , Z m £ X\ ) 



= Cz(2)-Z(i)-a)+ + - • • + (Z( m )-Z( m -i)-a) + + (l + z (1) -z (m) -a)+ 
we can derive the recursive integral equation. For the case a < 1/2, this is 

(4.23) +m I I") f " x m+k - l (\-x-ar k EDr k , a/x) dx 



+ m x m+n ^ED^, a/x) dx; 



whereas, if a g 1/2, then the equation is simpler: 



(4.24) ED,™:} = (*" + n )~\\ -a) m + n +m 






ED(„, a / X )dx. 



These may be solved. It can be shown by induction that the solution to (4.23) 
and (4.24) is 

(4.25) eo^. (* + » - I)" 1 £ (7)(^; 1 1 )(1 - la)V-\ 

where (1 — la)+ is equal to zero if (1 — la) is negative. The moments of coverage 
are then obtainable as 

(4.26) £C ( " a) = E(l-A».<.)r = l+ I {-lf^T)EDl, a) . 
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It may be instructive to write the first four moments of D na explicitly. 
/xi = (l-a)", 

n+l n+1 

6 ri xn+2 , 6(n~l) <,„\n + 2 

(n + l)(n+2) (n + l)(/i 4-2) 

(4.27) 

, (w-lKw-2) ^ +2 

(n + l)(n + 2) 

= 24 +3 36(«-l) +3 

^ 4 (« + l)(n+2)(n+3)^ "' (n + l)(n +2)(n +3) K "' 

12(«-lX»-2) n „ +3 («-!)(/* -2)(k- 3) , n+3 

+ (« + l)(«+2)(n + 3) (1 3a)+ + (« + l)(n+2)(n+3) (1 * a)+ ' 



Distribution of coverage. From the moments of coverage, the distribution of 
coverage P(C( n ,a) = is, in theory, completely determined because a distribution 
on a finite interval is uniquely characterized by its moments. By considering 
special cases, one may make an educated guess at the coverage distribution. 
Because we have the moments of coverage, this guess may be verified. By the 
use of this method, the cumulative distribution of coverage can be shown to be 

F n , a (t) = P(C(n.a)^t) 

<« 8 > -i?<-ri^('; 1 )(- t - , )(i-^-.):-- 

where we use the convention that 

, , , fl. 1-la-t^O, 

< 4 - 29 > ('-'"Ho, i-/«-,<o. 

The coverage, C (n , a ), is a continuous random variable except for a point mass 
arising under one of two possible situations. First, if na > 1, then n random arcs 
of length a will have positive probability of covering the circle. In this case, C ( „ ia) 
has positive mass at 1 . Second, if na < 1 , then n arcs of length a will have 
positive probability of not overlapping. In this case, C(„, a ) has positive mass 
at na. 

In fact, we can decompose Q„, a ) into a mixture of a continuous and a 
degenerate part: 

(4.30) C ( „, fl) = (^ (n ' a) ' probability P ' 



COVERING A CIRCLE CIRCUMFERENCE AND A SPHERE SURFACE 85 

where 

\l (-l)'(")(l-'«)r\ if nasi, 

(4.3i) P = <=° ;'/ 

l(l-na) , ifna<l, 

A ( „, a ) is degenerate at l-(l-rca)+ because 

1, «£>!, 



(4.32) ^< n ,«o=', ^, 

\na, na<l, 



and B( B .o) has density 

£ J, , ^;+ir/«-l\/«-l\//-r 

-p 

and the functions F„ >a (r) and f„, a (t) are piecewise polynomials. 



(«-M)/-w- T £rfi 1 (-ir(^: 1 1 )("- 1 )(i: , I )(i-.r<.-w-'-'. 



Asymptotic distribution of coverage. If we hold arc length a constant and let n 
become large, the coverage probability tends to 1 . If we condition on the event 
that the circle is not covered, we can study the conditional limiting distribution of 
coverage. This can be shown by the method of moments to be asymptotically 
related to the exponential distribution. 

Using the decomposition (4.30), we see that the conditional coverage is 

(4.34) B(n,a)= C(„. )| {circle not covered}. 
From the moment formula (4.25) it can be shown that 

(4.35) lim E[n(l-B (n , a) )] m = (1 -a) m m\ . 

n-»oo 

Thus the limiting distribution of n(l -S ( „, a) )/(1 — a) is exponential. This is to say 
that 

(4.36) \imP( n(l ; B(n - a)) St) = 1-V. 
From this, it can be shown that 

(4.37) /»( " (1 "_^"- a)) >f)~n(l-ar 1 e" 

for each f, as n tends to infinity, in the sense that the limiting ratio of the two 
sides of (4.37) is 1. 

Random caps on a sphere. The problem of random arcs covering a circle can 
be generalized to three dimensions, where the situation becomes one of covering 
the surface of a sphere with randomly placed circular caps. Consider a sphere of 
unit radius on which are placed N circular spherical caps subtending a half angle 
a at the center of the sphere (see Fig. 4.4). Assume the centers of these caps are 
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Fig. 4.4 

independently and uniformly distributed over the surface of the sphere. We seek 
the probability P(N) that the sphere is completely covered. 

This problem arises in practice in the study of the attachment of antibodies to 
influenza virus particles. Antibodies are supposed to be cigar-shaped molecules 
which attach themselves at their ends to the virus particle. The influenza virus 
particle can be considered to be a sphere which infects a cell by coming in 
contact with it. If enough antibodies are attached to its surface, the virus cannot 
touch another cell; i.e., it is no longer infectious. In particular, since the cell 
which is susceptible to virus infection is much larger than the virus, its surface 
may be assumed fiat (see Fig. 4.5). An attached antibody thus prevents a 
spherical cap S on the virus from touching and thus infecting the cell. The virus is 
no longer infectious when its surface is completely covered by the spherical caps 
centered at the attached antibodies. 

The exact probability that a sphere is covered by N circular caps is not known. 
However, an asymptotic value for large N was derived by Moran and Fazekas de 
St. Groth (1962). Their method can be most easily demonstrated by applying it 
to the two dimensional problem of random arcs on the circle. Let Y be the 



Cell 



/,//,///////////////// 




Antibody 



Virus 



Fig. 4.5 
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measure of the set of points on the circumference not covered by the N arcs. 
Then the distribution of Y is clearly continuous on [0, 1] with a concentration of 
probability P(N) at Y = 0. 

We obtain the first two moments of Y in the following manner. Let the 
random variable xn(€) be the indicator function of the event that the point { is 
not covered by N" random arcs; i.e., 

(1 if £ is not covered by N arcs, 
(4 " 38) *~<Ho otherwise. 



Then 

(4.39) Y 

and 



i 



i 



E[xM)]dl 



(4.40) E[Y] = 
Clearly E[x\(€ )] -(1-a). Since the arcs are independent 

(4.41) E[xn(€)] = (Elxmf = (1 -*)"■ 
Thus we have 

(4.42) E[Y] = (l-af. 



Similarly 



(4.43) F 2 =f XN (£)d€ 

and 



XN(o)da> 



(4.44) E[Y 2 ]=\ f E[x N (£)x N (a>)]d€da,. 

Jo Jo 

The expression E[xn(€)xn(<o)] is just the probability that both £ and w are 
not covered. If the arc length from $ to co is greater than a, then £ and w are not 
covered if and only if no clockwise endpoints of random arcs fall on the two 
sections of length a shown in Fig. 4.6a. 

If the arc length from £ to w is less than a, say /, then £ and o) are not covered 
if and only if no clockwise endpoints of random arcs fall on the section of length 
t + a shown in Fig. 4.6b. Thus 

E[xn(£)xn(<o)] = P[l o> not covered] 

(4.45) f(l-2«)" if||f-<w||is fl , 

\(l-(t + a)f iiU-a>\\ = t<a. 
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Fig. 4.6a Fig. 4.6b 

(Here ||f -«|| denotes the positive arc length from £ to w.) Hence we have 



E[Y 2 ]= [ (l-2a) N d£dw + 



0S£, oiSl 



0S«,»S1 



[l-(t + a)] N d£dco 



||«-<4=(Sa 
1 . 1 



E[Y 2 }=\ (l-2afd£da> + 

/• 1 t> at— a iii * i 

(4.46) = (l-2ar^da>+ (l-2a) N d£d« 

« 1 » ti> » 1 - (o+a 

+ [l-(a>-£ + a)fd£da>+\ [1 - (£ - w + a)] N d£ da> 

JO J<u-a JO J<o 

= f (l-2a)%>-a)<fo>+[ (l-2a) N (l-<o-a)do> 
Jo Jo 

For large JV, the set of points which are not covered is with high probability a 
single interval if it is not empty. Furthermore, the length Y of this interval has a 
negative exponential distribution. (It may be thought of as the waiting time to 
the first endpoint of the series of overlapping arcs.) Let fii denote the first 
moment about the origin of this negative exponential distribution. Then the 
second moment about the origin is 2/u. 2 and we have 

(4.47) E(Y)=[l-P(N))fiu 

(4.48) E(y 2 )=2[l-P(A0W. 
Thus 



(4.49) 



2*2 (Y ) 
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Hence the probability that the circle is completely covered is 

(4.50) P(N)~l-N(l-af~\ 

Note that this is precisely what we would have arrived at using our previous 
result 



P[Nogaps]=I ( .)(-iy(i-/ a )"-\ 



For large N, the terms (1 -ja) N l for /> 1 will be negligible compared with the 
term (1 — a) ~ and we get 

(4.51) P[Nogaps]~l-N(l-a) N ~' 
which is the same as that obtained above. 

Application of method of moments. We shall now apply this same method to 
obtain the probability that the surface of the sphere is completely covered by the 
spherical caps. Since the sphere has unit radius, its surface area is Ait. Each 
spherical cap of angular radius a will have an area of 2*7r(l-cosa). Thus the 
probability that any specified point on the sphere will be covered by a randomly 
placed cap is 

(4.52) A =-(l-cosa) = sin 2 ^. 

A ring of width dQ> and angular radius <$> (i.e., the ring subtends an angle 2$ at 
the center of the sphere) on the surface of a sphere of unit radius has area 
277 sin $ d® (see Fig. 4.7). 
Thus the area of a circular cap of angular radius a is 



(4.53) 



2-tt sin4> d<J> = 27r(l-cos a). 



o 



Then the probability that N random caps do not cover some particular point Pi 
is (1 - A ) N . Define the random variable x N (Pi) as above; that is, let x N (Pi) be the 




Fig. 4.7 
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indicator function of the event that the point Pi is not covered by N random 
caps. Let Y be the proportion of the surface of the sphere not covered. Then 
Y = \x N {Pi)dwi where dwi is an element of surface area surrounding P x . Then 

(4.54) E(Y) = ^-\E[x N (P i )]dw 1 = E[x N (P 1 )] = ^-^f 

and the second moment is 

1 r r 

E[x N (P 1 )x N (P 2 )] dw 1 dw 2 . 



(4.55) E{Y 2 ) = -^- 2 

107T 

Let Pi and P 2 be separated by an angle 0, and let /(<&) denote the combined 
surface areas of the caps of angular radius a centered at Pi and P 2 . A random 
cap covers one of the two points if its center falls on the region of area /($). Thus 
the probability that one random cap covers one of the points is /(<l>)/(47r). The 
probability that N random caps leave both Pi and P 2 uncovered is 

(4.56) E[x N (Pi)x N (P 2 )] = [l -—/(*)]" 

Since E[x N (Pi)x N (P 2 )] depends upon Pi and P 2 only through <£> we can 
evaluate E(Y 2 ) by using the following trick to integrate over d$> instead of dw\ 
and dw 2 . The integral 



I E[x N (Pi)x N (P 2 )] dwi dw 2 



(4.57) E(Y 2 ) = —— 2 

iOTT 

can be viewed as the expected value of a function E[x N (Pi)x N (P 2 )] of random 

points Pi and P 2 which fall with a uniform distribution on the surface of the 

sphere. (Remember the expectation in E[x N (Pi)x N (P 2 )] refers to random caps, 

not the random points Pi and P 2 .) The two points P x and P 2 subtend an angle 

between <& and <i> + d<t> at the center of the sphere if and only if the second point 

falls in a ring of area 2ir sin 4> dQ (see Fig. 4.8). This event happens with 

probability 

27rsin<J>d$ 1 . , , . 
= - sin 4> d<&. 

4tt 2 

Thus we can substitute 2 sin 3> d<J> for (dwi/(4-rT))(d<o 2 /(4TT)) and we have 

(4.58) E(Y 2 ) = ~^[l-~f(<P)] N sm<S>d<P. 

When <I>g2a, the two circles around Pi and P 2 do not overlap and /(<&) = 
47r(l -cos a). When <!>< la we have (by spherical trigonometry) 

fl", 1 _i/tan|$M 1 _,/sinH>\ , n ^*.^~ . 

f«b-\ W 1 cos — cosa+-cos l [-—-) (0g<&S2a) 

(4.59) l JK ' = )> ir \ tan a /J it \sma / 



47T 



(cos a (2ag$S-7r). 



rl 



Thus E(Y ) can be evaluated by numerical integration. 
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Fig. 4.8 

We can therefore obtain the first two moments of the distribution of Y, the 
fraction of the total area not covered. This distribution will consist of a concen- 
tration of probability, Pr (N), at the origin together with continuous probability 
distribution on the range (0< F<oo) and can be written 



(4.60) 



Pr[y<y] = Pr(JV)+ [l-Pr(JV)]g(y)rfy 






where g(y) is the density of V given that Y>0. In other words, for F>0 the 
density is just [1 -Pr (N)]g(y). Let /u, and fi 2 denote the first two moments of 
the density g(y). Then 



E(Y) = (1-?t(N))ii 1 



£(y 2 ) = (l-Pr(^)) M2 



and 



(4.61) 



1-Pr 



(N)=(^ 



H2\[E(Y)] 2 



7 E(Y 2 ) 



Thus if we can find /X2fj. f, Pr (N) can be obtained from the above expression. 

To obtain ^ti and /n 2 we note that for large N the size of any uncovered area 
on the sphere's surface will be small compared with the radius of the sphere and 
with the radius of the random caps. Thus the curvature of the surface of the 
sphere can be neglected, the region can be considered to be planar, and the 
boundary of the region may be taken to be made up of straight lines. Further- 
more since the caps are uniformly distributed on the surface of the sphere, the 
straight lines are random lines of a Poisson field in the plane. Thus the area of 
any single (i.e. connected) uncovered region has approximately the same dis- 
tribution as the area, A, of a polygon formed by random lines of a Poisson field 
in the plane. We have already obtained the values of E(A) and E(A ) during our 
discussion of random lines in the plane, namely 



E(A)=tt/t\ 



E(A 2 ) = \tt 4 /t A 



where t is the intensity of the random lines in the Poisson field. 
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In the example with the random arcs on the circle we saw that for large N the 
probability of more than one disjoint uncovered area was negligible. If we make 
the similar assumption that the probability of more than one disjoint uncovered 
areas on the sphere is negligible compared with the probability of one such area, 
we get iii = E(A)and fi 2 = E(A 2 ). Then 

M2 1 2 

Mi 2 
and 

(4.62) Pt(N)=1-~tt 2 [E(Y)] 2 /E(Y 2 ). 

This formula is not satisfactory for low values of TV as it yields negative values of 
Pr (N). An alternative hypothesis is that the number of uncovered areas, M, is 
distributed in a Poisson distribution with mean v. This cannot be true when the 
random caps are large for in the cases where a s 90° there cannot be more than 
one region uncovered. For smaller values of a, however, this assumption seems 
plausible. We also assume that the areas, Y t , of the individual regions are 
distributed independently. If the total area is Y we have Y - Y\ + ■ • • + Y M and 
using the moments of the Poisson distribution we get 

E(Y)=vE(Yi) 

and 

E(Y 2 )= v E{Y?)+v\EYif. 
If we now put E(F i 2 )[£'(y i )]" 2 = W we have 

(4.63) y = |7r 2 {E(y 2 )[£(y)r 2 -l} _1 

and the probability that the random caps do not cover the sphere is 

l-Pr(JV)=l-exp(-j;) 
(4.64) 



= l-exp(-| 7 r 2 {£(y 2 )[E(y)]- 2 -l}- 1 ). 



Moran and Fazekas de St. Groth tested the accuracy of the above formula 
with an experiment using ping-pong balls. One hundred circular holes were 
punched in an aluminum sheet. The diameter of these holes was chosen so that 
the part of a ping-pong ball which may protrude through each hole is a spherical 
cap of angular radius a =53.43° (i.e. subtending a half angle of 53.43° at the 
center of the ping-pong ball). This value of a was chosen because for the 
influenza virus the radius of the sphere is 40mjii while the antibodies are of 
length 27 m//, so that the shielded area subtends a half angle of 53.43°. One 
hundred ping-pong balls were placed in the holes and were held firmly against 
the holes by a foam rubber pad. The spherical caps visible from the other side of 
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the aluminum sheet were sprayed with paint, thus simulating the dropping of a 
random cap of paint on each ping-pong ball. After drying, the ping-pong balls 
were removed, rotated randomly and replaced. Then another random cap of 
paint was added to each ball. The following table (Table 2) shows the number of 
balls not completely covered with paint after N sprayings in each of three times 
the experiment was carried out. 

Table 2 





Number of balls not covered 


N 


l-Pr(iV) 


1st expt 


2nd expt. 


3rd expt. 


10 


1.0000 


100 


100 


100 


15 


.9846 


95 


96 


97 


20 


.8009 


76 


82 


80 


25 


.4916 


53 


62 


52 


30 


.2502 


28 


32 


24 


35 


.1143 


12 


19 


11 


40 


.0489 


6 


8 


3 



An exact expression for Pr (N) is presently not known except for several 
special values of a. These special cases include the following: 

1) Pr(A0 = if Nsin 2 (a/2)S 1. This is obvious when we remember that 
sin 2 (a/2) is the fraction of the sphere's total surface covered by one cap. 

2) Pr (2) = cos 2 a if a > 90°. The portion of the sphere not covered by one cap 
is itself a spherical cap of angular radius 2tt -2a, which will be covered by a 
second random cap if and only if the center of the second random cap falls at 
great circle distance 277 - 2a or more from the center of the first random cap (see 
Fig. 4.9). 

Thus the probability that the sphere is not completely covered is just the 
probability that the center of the second random cap falls in a cap shaped area (C 



Pole of first cap 



Area left uncovered 
by second cap 




Pole of second cap must fall within C 
for sphere to remain uncovered. 



2-;r-2a 



Area left uncovered by 
first cap 



Fig. 4.9 
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in Fig. 4.9) of angular radius 2tt - 2a centered at the center of the first cap. This 
probability is just sin 2 1(277 -2a) or sin 2 a. Hence the probability that the sphere 
is completely covered is Pr (JV) = 1 — sin" a = cos" a. 

3) Pr(JV)=l-2"(/V 2 -JV + 2)whena = 90°. When a =90° each random cap 
is a hemisphere. The surface of the sphere is cut up into JV" - JV + 2 pieces by the 
JV great circles which are the circumferences of the N random hemispheres. 
(This can be seen by the same argument as was used in the derivation of E{N) 
for random lines in the plane, namely: One great circle splits the surface of the 
sphere into two pieces, and the Mh great circle intersects each of the previous 
JV- 1 great circles twice, thereby cutting 2(JV- 1) of the previous pieces in half, 
or adding 2(JV - 1 ) to the total number of pieces. Thus if M N denotes the number 
of pieces into which JV great circles cut up the surface of the sphere, we have 
Af N =Af N _i + 2(Ar-l) or M N = 2 L"i (/- 1) + 2 = JV 2 -JV + 2.) To each great 
circle there are two equally probable ways the cap could have landed and still 
have had the same great circle as a circumference. For example, the center of the 
cap could be either the North pole or the South pole with equal probability. 
Thus, given the JV great circles and the pieces into which they cut the surface of 
the sphere, each piece has probability 2 of not being covered by a certain random 
cap and probability 2" of not being covered by any of them. Since there are 
N 2 -N + 2 pieces, the probability is 2~ N (JV -N + 2) that any piece is not 
covered by any of the JV random caps. Thus the probability that the whole 
surface of the sphere is covered is 

Pr(JV)=l-2~~(JV 2 -JV + 2) 

when a = 90°. 

This result is a special case of a more general result in geometric probability 
due to J. G. Wendel (1962). He showed that if JV points are scattered at random 
on the surface of the unit sphere in n -space, the probability that all the points lie 

on some hemisphere is 2~ N+1 Y." k=0 ( ). 

In addition to these few special cases for which an exact expression for Pr (JV) 
can be developed, we can obtain upper and lower bounds for Pr (JV) which can 
be combined to yield the exact asymptotic behavior of Pr (JV) as N goes to 
infinity. These bounds were first derived by E. N. Gilbert (1965). 

An upper bound for Pr (JV) is easy to obtain. Any particular point, say the 
North pole, has probability (1 - A ) N of being left uncovered by all JV caps, where 
A = sin 2a is the fraction of the sphere's surface area which is covered by a single 
cap. Thus the probability that the North pole is covered by at least one cap is 
1 - (1 - A ) N . Since the sphere is not covered unless the North pole is covered, we 
have 

(4.65) Pr(iV)Sl-(l-A) N 

To obtain a lower bound for Pr (JV) let us define a crossing as being a point of 
intersection of the circular boundaries of two overlapping caps. See Fig. 4.10a. If 
there is an uncovered crossing, some area of the sphere outside of the two 
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overlapping caps must also be uncovered. Conversely, if at least two caps 
overlap, then the boundary of any uncovered area on the sphere must contain an 
uncovered crossing. Thus we find that the sphere is completely covered if and 
only if there are at least two caps which overlap and every crossing is covered. 





Fig. 4.10a Fig. 4.10b 

A cap intersects another cap if its center falls within a great circle distance of 
2a from the center of the other cap; that is, if its center falls in a large cap of 
angular radius 2a centered at the center of the other cap (see Fig. 4.10b). This 
event has probability equal to the fraction of the sphere's area which is covered 
by the large cap of angular radius 2a, namely, sin~ a. Clearly for every pair of 
overlapping caps there are two crossings. Thus the expected number of crossings 

between two particular caps is 2 sin 2 a. If we drop N caps, there are I 1 or 

N(N-\)/2 distinct pairs of caps and the expected number of crossings between 
all caps is 

N(N - 1) sin 2 a = 4N(N - 1) sin 2 j cos 2 1 

(4 - 66) =4N(N-l)A(l-\). 

The probability that a crossing will not be covered by the remaining A" — 2 caps is 
(1 — A) N ~ 2 . Thus the expected number of uncovered crossings between N caps 
on the sphere is 4N(N - 1)A(1 -\) N ~\ If we denote w(k) as the probability that 
the N caps leave exactly k crossings uncovered, we have 

(4.67) X kw(k) = 4N(N-\)\(\-X) N "\ 

k = \ 

Let U(N) denote the conditional expectation of the number of uncovered 
crossings given that not all crossings are covered. Thus 

(4.68) U(N)= £ kw(k)/(l-w(o)). 

k = l 

Therefore we have 

(4.69) \-w(o) = 4N{N-\)\{\-\) N ^/U{N). 
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But w{6) is the probability that all crossings are covered, which is the same as 
Pr(iV). Also, £/(/V) = 3 since an uncovered crossing occurs only if there is an 
uncovered area on the sphere, which requires at least three crossings in its 
boundary. Thus we have 

(4.70) Pr(N)^l-^N(N~l)A(l-\) N '\ 

For small N this is not very good, in fact it can be negative. On the other hand, 
this lower bound can be combined with our previously obtained upper bound 
to get 

log(l-A)sllog(l-Pr(AT))S^log(l-A)+^log[|iV(N-l)A]. 

Since the limit of the second summand on the right is zero, we have the 
asymptotic behavior of log (1 -Pr (AT)), i.e., 



(4.71) 



lim AA" 1 log(l-Pr(N')) = log(l-A) 

N-»oo 



-io.(i--.'f). 



CHAPTER 5 

Crofton's Theorem and Sylvester's Problem 
in Two and Three Dimensions 



Crofton's theorem on mean values. It is to M. W. Crofton, an English 
mathematician whose works appear in the latter half of the 19th century, that we 
are indebted for approaches by which results in geometrical probability that 
normally would require difficult and sometimes intractable integration are 
achieved by ingenious artifices. His methods permit the evaluation of some 
definite integrals without directly performing the integrations. In previous 
chapters we have referred to some of his work. We now discuss some other 
situations, develop Crofton's very useful mean value theorem, and apply it in 
some classical problems such as Sylvester's four point problem — namely, find the 
probability that four points taken at random within a given boundary shall form 
a re-entrant quadrilateral. An especially pertinent article by Crofton is the one 
he prepared for the 9th edition of the Encyclopaedia Britannica (1885), 
particularly the section entitled "On Local Probability". We now reproduce 
Crofton's results. 

Let us now consider a few specific questions. Assume that a specified space S 
is included within a given space A. Let a point P be taken at random on A and 
let p be the probability that it falls on S or we may write 

p = S/A. 

Now assume the space S is variable and that it has n equally probable values 
Si, S2, S 3 , • ■ • , S„. Then the chance that any one S, say Si, is taken and that then 
P falls on Si is 



>i = (£) (Si/ A). 



Let p = p\ + p 2 + ■ ■ ■ + p„ and so 

p.,, ,-( * + **,- +a - )(l)- M( wo 

where M(S) is the mean value of S. 

The chance of two points falling on S given that they fall at random on A is 
similarly 

(5.2) p=M(S 2 )/A 2 . 
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where M(S 2 ) is the mean value of the random variable S 2 . Note in these two 
illustrations that if the probability is known, the mean value follows, and vice 
versa. 

Let us explore this in connection with the following problem. Consider a line 
of length / and the distance XY between two points falling at random on / — here 
/ is A and XY is S. We wish to determine M(XY)", namely the nth moment of 
the random distance XY. To do this, consider the probability that n (additional) 
points taken at random on I fall in XY. Then 

(n + l)(n + 2) 

for the probability that X is one of the extreme points out of (n + 2) points is 
2/(n+2), and, if it is, the probability that Y is the other extreme point is 
l/(n + 1). However, we also have 

M(XY) n 

r 

and so 

21" 



(5.4) p = 
and so 

(5.5) M(XY) n ■ 



(n + l)(n+2) 



Let us consider another illustration by which the relationship between mean 
values and probabilities can be exploited. A line of length / is divided into n 
segments by (n - 1) points taken at random on the line. We wish to find the mean 
value of the product of the lengths of these segments. Let a, b, c, • • • be the n 
segments in one particular case and let n new points be taken at random over the 
line. The probability that one falls on each segment is 

.a b c , abc ■ ■ ■ 

n\- 



'l 11 ' l" ' 

Now we seek the same probability no matter how the line is divided by the 
(n - 1) random points. This is 

(5.6) Y M{abc " " " } - 

Now the number of different arrangements in which all (2n - 1) points may occur 
on the line is (2n - 1)!. Out of these arrangements, the number in which a point 
of the first series falls between every two of the second series is from per- 
mutation theory equal to («!)(« — 1)!. Thus 



and 

(5.7) 



n\ 
—M(abc 


n\(n-\)\ 
)_ (2n-l)! 


M(abc ■ • 


•> ='"-?>. 



(2»-l)! 
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We shall now derive an important formula known as Crofton's formula or 
Crof ton's theorem on fixed points. Let n points fi, f 2 , • ■ • , in be randomly 
distributed on a domain S and let H be some event which depends on the 
positions of the n points. Let S' denote a domain slightly smaller than S and 
contained in S. Denote by A5 the part of S not in 5'. If we take AS small enough 
that we may neglect the probability of two or more of the £, falling in A5, we get 
(where P now stands for probability) 

P[H] = P[H \£ u 6, • • ■ , & e S'] P[f ,, 6, • • ■ , in e S'] 

(5.8) 

+ X P[H | ii e AS, I e S' for all / * j) Pfe g AS] P[f e S' for all i* /]. 

Let s, s' and As denote the measures (e.g., area, volume, etc.) of S, S' and AS 
respectively. Since the points are random in S 

(5.9) P[£gAS] = - 

s 

and 

(5.10) P[f 1 ,6,---,fneS'] = [y] n . 
Then we get 

+ nP[//|f i e AS, f 2 , • • • , i n eS']s' n ~ l AS. 

Since the domain AS is taken to be small enough that we can throw away the 
higher order terms of AS, we have 

(5.12) s" = (s' + As)"ss'" + m'""' As. 

(These approximations will become exact when we let As become infinitesimal.) 
Thus we have 



(5.13a) 
or 


P[H](s'" + ns ,n - i te) = P[H\gu&,- ■ • .gneSW 

+nP[H | f t e AS, 6, ■ ■ • , in £ S'K"' As 


(5.13b) 


s'(F[Hl-P[//|f„f 2 ,--,^GS']) 

= nAs(F[//|f lG AS,6,---,^eS']-P[//]). 



We now let As become infinitesimal. Then S' is approximately S so that s' 
approaches s, the difference P[W] — P[H \i\, f 2 , ■ • • , in £ S'] is a small increment 
5P[//], AS becomes 8s, and P[// 1 i j e AS, f 2 , • • • , i„ e S'] becomes the proba- 
bility P[H\iieSS] that H occurs when one of the random points is on the 
boundary 8S of S. Thus we arrive at Crofton's formula 

(5.14) 8P[H] = n(P[H\i l e8S]-P[H])s' 1 8s. 
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As a simple example of an application of Crofton's formula suppose £i and £ 2 
are random points on an interval of length L. Let A" = |£i~£>l and let F(x, L) 
denote the cumulative distribution function of X, that is F(x, L) = P[X < x]. By 
Crofton's formula 8F(x, L) = 2(P[X <x\£i at end of interval] -F(x,L))L~ l SL 
since £ 2 is uniformly distributed on the interval 

(5.15) P[X < x | £i at end of interval] = x/L 
and (we assume dF(x, L)/dL exists) 

(5.16) ^^ = l(x/L-F(x,L)) 

or 

L 2dF(±L) +2LF(xL) = 2 ^ 
dL 

-^(L 2 F(x,L)) = 2x, 

(5.17) L 2 F(x,L) = 2xL + constant 

For L = x we want F(x, L)= 1. Thus the constant is -jc 2 and 

(5.18) F(x,L) = j2(2Lx-x 2 ). 

There is a form of Crofton's formula which applies to mean values. If A" is a 
random variable which depends on the positions of n random points 
£u £2, • • • , £n in S and if S' and A5 are as before, we have for small AS 

E(X) s E[X I €1, £2, ■ ■ ■ , tn e S')P[Z U &, • • ■ , £„ e S'] 

(5 19) 

v ' ; + £ E(* I £ e AS, 6 e S' f or all i * /)P[£ e AS] P[£ e S' for all i * ;] ; 

(=1 

if, as before, s, s' and As denote the measures of S, S' and AS respectively, we 
have 

E(X)s"sE(X\£ u £ 2 ,---,£ n eS')s' n 
(5.20) 

+ nE(X |6 6 AS, &,••■,& e S V As 

and by the same arguments as before we get 

(5.21) SE(X)= n[E(X\£ 1 <=8S)-E(X)]s~ 1 8s. 

A more general form of Crofton's theorem has been provided in recent years 
by Ruben and Reed (1973). This extension concerns problems in which points 
are chosen at random in each of a number of domains. Assume for / = 1, • • ■ , k ; 
Dj is a domain in n^-dimensional Euclidean space with content V,, and that JV, 
points are chosen at random from Dj. Now let ^ be the expected value of some 
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function of £ ; = 1 Nj points that depends only on the intrinsic properties of the 
points and their relative positions and not on the domains D t or the positions of 
the points relative to them. Then the extended Crofton theorem states that 

* dV 

(5.22) dn=ZN l (rf-n)-j7 i 

where dfi is the increment in /u obtained by increasing such D t by an infinitesi- 
mal increment dV„ and /j. * is the expected value of the function where one point 
is chosen at random in the boundary in £>,, Nj - 1 points are chosen at random in 
D, and N points are chosen at random in D t for / ^ j, i = 1, 2, • ■ • , k. 

As in the original Crofton theorem a direct corollary is that if P is the 
probability that £,-=i N, points satisfy some intrinsic property then 

* dV 

(5.23) dP = I N,{Pf-P)^. 

This result follows from the extended theorem by considering an indicator 
function with values 1 or if the property is or is not satisfied. 

The extended theorem allows repeated application, and this enables the total 
degrees of freedom of points selected at random in, for example, an n -simplex or 
polyhedron, in n -space, to be reduced by n + 1. The expected value of a function 
of points can be decomposed into a linear combination of expectations, the 
degree of expectation in each of which has been reduced by n + 1. In the original 
Crofton theorem, the degrees of the freedom of the points is reduced by one, 
and thus the degree of integration involved in finding the expectation is reduced 
by one. 

Sylvester's four point problem. Many complex problems are solved through 
the use of Crofton's formula. One example is the famous Sylvester four point 
problem, which we shall now consider. In the late nineteenth century the English 
mathematician J. J. Sylvester posed the problem of finding the probability that 
four points taken at random inside a convex domain D form a re-entrant 
quadrilateral, that is, one of the points lies inside the triangle formed by the 
other three. A re-entrant quadrilateral can be formed from four points in four 
different ways, according to which of the four points occurs inside the triangle 
formed by the other three. If the domain D has area A D and the mean area of a 
triangle formed by three random points X, Y and Z in the domain is 
E(AreaXYZ) then the probability Pr [RQ] of a re-entrant quadrilateral is 

(5.24) Pr[RQ] = 4E(AreaAYZ)M D ; 

clearly we can reduce or expand the scale of the domain D and the ratio 
E(AreaXYZ)/A D remains unaffected. (In this section, we use Pr to denote the 
probability.) Therefore the probability of four random points forming a re- 
entrant quadrilateral in D is the same as if the four points were random in a 
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slightly smaller domain with the same shape. Thus we have (5Pr[RQ] = 0; in 
Crofton's formula 

(5.25) SPr [RQ] = 4(Pr [RQ|one point on boundary] -Pr [RQ])Ad 8A d 
which implies that 

(5.26) Pr [RQ] = Pr [RQ| one point on boundary]. 

Let O denote the random point on the boundary 8D of D. Given that O e SD, a 
re-entrant quadrilateral can be formed from three random points in D and 
O e 8D in three ways, according to which of the three random points in D occurs 
inside of the triangle formed by O with the other two random points. Thus if 
£"(Area OXY \Oe8D) denotes the mean area of a triangle formed by X and Y 
random in D and O random on 8D, we have 

(5.27) Pr [RQ] = Pr [RQ | O e 8D] = 3£(Area OXY | O e SD)/A D . 

We shall now evaluate ^(Area OXY | O e8D) and Pr [RQ] for several types of 
convex domains D. 

Let us first suppose D is a triangle, ABC. Figure 5.1 shows the triangle ABC 
together with the random point O on the boundary. Let Si denote the part of 
triangle to the left of the line AO and Si the part to the right of AO. We shall 
use the subscript O in expectations such as £ (Area OXY) to indicate that the 
expectation is over the distributions of only X and Y and that O is treated as a 
fixed point on the boundary. Thus for any fixed point O on the boundary the 
expected area of the triangle formed by O and two random points X and Y in 
ABCh 

E (Area OXY) = £ (Area OXY \X, Y e 5,) Pr [X e Si] Pr [ Y e Si] 

+£o(Area OXY \X, Y e S 2 ) Pr [X e S 2 ] Pr [ Y e S 2 ] 
(5.28) 

+Eo(AieaOXY\XeS u Y eS 2 )Pr [XeSi] Pr [YeS 2 ] 

+£o(Area OXY\Xe S 2 , Y e 5,) Pr [X e S 2 ] Pr[YeSi]. 
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If we denote the areas of S u S 2 and ABC by s u s 2 and A D respectively, we get 
Eo(Area OXY) = £ G (Area OXY\X, YeS^i/Ab 

(5.29) +E (Ar<zaOXY\X, YeS 2 )sl/A 2 D 

+2Eo(AreaOXY\XeS u Y e S 2 )siS 2 /Ad- 

In order to calculate E G (Area OXY) we must evaluate the conditional 
expectations on the right hand side of the equation. We shall evaluate 
E (Area OXY \ X e S u Y e S 2 ) first. Clearly the expected value of the distance 
from a random point X e Si to a line not passing through 5i is the distance from 
the line to the center of gravity Gi of Si. Then for a fixed point Y eS 2 the 
expected area of the triangle OXY over all positions of X e Si is 

£o(Areaoxyjx e Si, y=F () ) = ^l|om|£o(l|oy () toXi||r=yo) 

(5.30) =^||Onlll|OV„toG 1 || 

= AreaOY" () Gi 

where ||OYo|| denotes the length of the line segment OY and ||OY toGi|| 
denotes the perpendicular distance from Gi to the line OY n . The area of the 
triangle OYoGi can also be written 

(5.31) E (AreaOXY\XeS u Y = Y ) = ^||OG,|| ||OGi to Y ||. 

The mean value £ (Area OXY\X e Si, Y e S 2 ) is the expectation of 
E {AreaOXY\XeS u Y = Yo) over all Y eS 2 . Thus 

E (ATeaOXY\XeS u Y e S 2 ) = E (£o( Are a OXY | A'eS,, Y)) 

= ^||OG 1 ||£ (l|OG 1 toy||) 
(5.32) 

= ^||OG,||||OG,toG 2 || 

= Area OG x G 2 

where G 2 is the center of gravity of S 2 . 

We shall now evaluate the area of the triangle OGiG 2 . (See Fig. 5.2.) We shall 
show that 

(5.33) Area OG x G 2 = - Area ABC. 

We shall also show that 

2 

(5.34) Area AG X G 2 = - Area ABC 
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Fig. 5.3 



a fact which we will need later on. We use the fact that the three medians of a 
triangle are concurrent at the center of gravity, which is two thirds of the way 
along each median from the vertex to the opposite side. Hence Gi is the point of 
intersection of the medians of ABO and G 2 , the point of intersection of the 
medians to AOC. Since d and G 2 are the same fraction, 2/3 of the way from A 
to the line BC, the line GiG 2 must be parallel to BC. Furthermore the line G\G 2 
extended intersects any line perpendicular to the base BC (for example the 
broken line in Fig. 5.2) two thirds of the way from A to the line BC. Thus we 
have 



(5.35) 



Altitude OG1G2 = - Altitude ABC 
3 
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and 

2 

(5.36) Altitude AG, G 2 = - Altitude ABC. 

Since G, occurs two thirds of the way along the median BM (see Fig. 5.3), the 
projection of BG, on the base BC is two thirds of the projection of BM on BC. 
Similarly the projection of CG 2 on BC is two thirds the length of the projection 
on BC of the median CM. Hence the projection of G\G 2 , which is BC minus the 
projections of BGi and CG 2 , is one third the length of BC. Since G\G 2 is 
parallel to BC, we have 

(5.37) IIGjGjH \\\BCl 

Combining this result with our previous results for the altitudes of OG\G 2 and 
AG\G 2 we have 

(5.38) Area OG,G 2 = - Area ABC, Area AG, G 2 = - Area ABC. 
Thus we have 

(5.39) E (AreaOXY\XeSi, YeS 2 ) = AreaOG 1 G 2 = -A D . 

We shall now evaluate E (Area OXY \X, Ye Si). We must first prove the 
following fact. For any two triangles OLR and O'L'R' with random points X 
and Y in OLR and X' and Y' in O'L'R', we have 

£(Area OXY) _ EjAreaO'X'Y') 
( " ' ' Area OLR ~ Area O'L'R' ' 

This fact is not immediately apparent but can be seen in the following way. The 
triangle OLR can be transformed into O'L'R' by projecting it obliquely onto 
another plane and changing the scale. The projection and the change of scale 
multiply the area of the inner triangle OXY and the total area of OLR by the 
same factor. Thus if X and Y denote the images of X and Y under the 
projection and scale change, we have Area OXY/ Area OLR = 
Area O'XYj Area O'L'R'. The projection and scale change preserve the uni- 
form distribution of X and Y, that is, X and Y are randomly distributed in 
O'L'R' and have the same distribution as X' and Y' . Hence 

£(Area OXY) = £(Area OX' Y') 
Area OLR ~ Area O'L'R' 

We shall now use this fact to derive .E(Area OXY) for two random points X 
and Y falling in a triangle OLR. Let M denote the midpoint of the side LR and 
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let Vi and V 2 denote the areas to the left and right, respectively, of the median 
OM. (See Fig. 5.4.) Then we can write 

E(Area OXY) = £(Area OXY\X, Ye VjXPr [Xe V,]) 2 

+ E(Area OXY\X, Y e V 2 )(?i [X e V 2 ]) 2 
(5.41) 

+ E(AreaOXY\XeV u YeV 2 )Pr[XeV 1 ]?r[YeV 2 ] 

+ E(Area OXY\X e V 2 , Ye V^Pr [Xe V 2 ] Pr [Y e V,}. 
Since OM bisects the triangle, 

(5.42) Pr[A'6Vi] = Pr[A'6V 2 ] = i 

and 

E(Area OXY) = ~[E(Area OXr|X, Ye V,) 

(5.43) +E(Area OXy|X, Ye V 2 ) 

+ 2E(Area OXY\Xe V u Ye V 2 )]. 
Since the areas of V^ and V 2 are each one half the area of OLR, we have 

E(Area OXY) = E (Area OXY\X, Ye V 2 ) 
' Area OLR ~~ 1 Area OLR 

which implies 

(5.45) E(Area OXY\X, Ye Vi) = if (Area OXY) 
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for / =1,2. Substituting i£(Area OXY) for £(Area OXY |X, Y e V,) and 
£(Area OXY | X, Y e V 2 ) in our equation for £( Area OXY) we get 



E( Area OXY) = - 

4 



-E(Area OXY) +^£(Area OXY) 



2 
(5 - 46) + 2£ (Area OXY JXe V,, Ye V 2 ) 

which yields 

(5.47) £(Area OXY) = ^£(Area OXY \ X e V x , Y e V 2 ) = = Area Or, T 2 

where Ti and I 2 denote the centers of gravity of V) and V 2 respectively. We 
have already found the area of a triangle of this type to be 

~> 

(5.48) Area OY K T 2 = ^ Area OLR. 

Therefore 

(5.49) £(Area OXY) = ^- Area OLR. 

Applying this result to the two triangles S, and S 2 in our original triangle ABC 
we get 

£o(AreaOXY!X YeS,) = ^rs,, 
(5.50) 

£o (Are a OXY |X, YeS 2 ) = ~ s 2 . 

We can now return to our expression for the expectation of the area of OXY 
conditioned on the position of O on the boundary of the triangle ABC. 

Eo(AreaOXY) = £o(AreaOXYiX, YeS^/^D 
+E (Area OXY jX, YeS 2 )sl/Ab 
(5.51) +2E (AreaOXY|XeS 1 , Y eS 2 )s { s 2 /A 2 D 

__ J_ 



4 , 4,2 
^ s i +xz:^t +~A D Sis^ 
27 27 " 9 



Up to now we have considered O as a fixed point on the boundary of the 
triangle ABC. Now we will average over all possible positions for O on the 
boundary in order to obtain the unconditional expectation of the area of the 
triangle OXY, which we can then use to find Pr [RQ], the probability of four 
random points forming a re-entrant quadrilateral. We need average only over 



108 CHAPTER 5 



possible positions for O on one side, say BC, of the triangle because the situation 
is identical for O on each of the other sides, AB and AC. 




Fig. 5.5 



Now suppose the length BC is b and the point O lies at a distance x from B. 
Furthermore, denote the height of A above the base BC by h. (See Fig. 5.5.) 
Then 

111 
(5.52) A D =-bh, Si = -xh, s 2 = -(b-x)h, 

and the expected area of OXY regardless of the position of O on the boundary 
is 

(553) Z> 3 /i 2 L216 + 216 + 72 108 J 

Thus the probability of a re-entrant quadrilateral being formed from four 
random points in a triangle is 

Pr[RQ] = Pr[RQ|Oe<5£>] 
(5.54) =3E(OXY\OeSD)/A D 

_1 

The method by which we have obtained Pr [RQ] for four random points in a 
triangle can be extended to derive Pr [RQ] for four random points in any convex 
polygon. Suppose the polygon has n sides. Then we can take a point O uni- 
formly distributed on one of the sides. From O we can draw lines to each of the 
vertices of the polygon thus dividing the polygon into n - 1 triangles, 



CROFTON'S THEOREM AND SYLVESTER'S PROBLEM 



109 



Si, S2, • • • , S„-i (see Fig. 5.6). For any fixed position for the point O we can 
calculate 

Eo(Area OXY) = \ E Q (Area OXY \X, Y e S,) Pr [X, Y e S t ] 

(5 - 55) 'I-!™-! 

+ 11 £o(Area OXY | X e S„ y e S,) 

i = l ; = 1 



Pr[XeS,]Pr[yeS ; ]. 




Fig. 5.6 



Alikoski (1939) has solved the Sylvester problem for all regular rc-gons 
employing the results obtained above for the triangle. He proves that 



(5.56) 



Pr [RQ] = 



9 cos w + 52cosw+44 
9n sin a> 



and 



We now present his development. Let Pq,P\, ■ ■ ■ ,P n -i be a regular »-gon, 
denoted together with its area by T. Choose a rectangular coordinate system so 
that the polygon lies above the jc-axis, the side P n -\P Q is on the x-axis, its 
midpoint is the origin. (See Fig. 5.7.) Let (x„, y„) be the coordinates of P v . We 
choose a point P of abscissa x on the side P„-iP and denote by T^ v (x) the 
triangle PP M P„ and its area. Now take two points X and Y in T at random and 
let M(x) be the mean of the area of the triangle PXY. We further denote M„{x) 
the mean area of the triangle PXY when only those triangles are included for 
which both X and Y belong to T v -i iV (x), and M„,„(x) the mean area when one is 
in Tn-i <IL (x) and the other in T v -i fV (x). 

We can now write 



(5.57) 



M(x)= I ( 



'/^i,W 
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If Q v is the center of gravity of the triangle r y -i.„(jc) and R„ the midpoint of 
P v -iP v , then we have from our previous results for the triangle, 

4 
M„(x) = —- ■ T„^i, v (x), 



M M ,„(x) = APQ^a = - ■ kPR^.. 



Denoting 

(5.58) 

we can write (5.57) as 



T^(x)=APR„R, 



T 2 -M(x) = ^ n Z[T^Ux)l 3 



(5.59) 



8 " -1 " _I 



Using the expression of the area of a triangle as a determinant, we get (jj. < v) 



T„Ax) = : 



1 



1 1 


l 


x x^ 


x v 


y. 


y* 



or T^„(x) = -(A^ v + x ■ B^„), 



where 
(5.60) 



x^ x v 

y*i y- 



B^ = y^~y r . 
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in 



or 



%A*) = : 



1 



1 



1 



1 



x -(Xn-i + xJ -(x t ,-i+x,.) 

1 1 

o ^(y.-i + yj ^(y..-i + y,.) 



where 

(5.61) 



T„Ax) = -(A^, + x-B„,„l 



A-V..V ~ ^(i-l.f-1 "•" A„ ( ,,-1 + A^- 1 .,. + A,,,!,, 



Noticing that r„,,,(x) is identically zero, we can write (5.59) in the following 
form: 

T 2 ■ M(x) = ~"l (A v - l .„+x ■ B„^, v f 
54 ,. = i 

(5.62) +-^V £ (A,, ,,,+.*•£?,.-,. ,,)(A M - ,.,,+.* -B M -,. J 

•(A^.-f.v •£*„.,.). 

7he right hand side in (5.62) is a polynomial in x of the 3rd degree at most. 
But since M{—x) = M(x) by symmetry, the terms of even degree must be zero, 
so that we may write 

(5.63) T 2 -M(x) = a+(3x\ 
A simple transformation yields: 



(5.64) 



J n- 1 I n- I . 

54 „=i 36 v -.-.\ M = i 

i8= — "l A l .- l , i ,Bl- l .,-+—\ B, i., I (B M -,. fi A A ,.„ + 2A^ 1 . M B M .„). 
lo , =i 3d ,,= i M -l 



The expression for a comes directly from (5.62) while that for /3 can be deduced 
from (5.62) as follows. From (5.62) it follows that 



(5.65) 



18 „=i 36 ,.= i (i-i 

36 „=.-i M =i Jo ,.= i M = i 
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In order to get the expression above for /3 we have to show that the last two 
sums — which we denote by Si and S 2 — are equal to one another. In fact we have 

(5.66) 5 2 = I I B^A^uJ^. 
But since B~^ v = -B Kll 

(5.67) I t ^-1.^-1.^,, = 
so that, since B^ v = 0, 

rt — 1 ft _ 

(5.68) 52 = — £ L Bv-i^Ap-i^Bw 

Interchanging the letters fi and v, and noting that fl„, K - ~B^ V , we get S2 = 5i. 

Let M be the mean of M(x), when P is a randomly chosen point on P„-iP . 
We get: 

- 1 r*° 

(5.69) M = — M(x)dx, 

Xo Jo 

that is 

(5.70) M = ^(3a+t3x$). 

If we let M be the mean of the area of the triangle XYZ where XYZ are 
randomly chosen inside the polygon, then it can be shown 

4M = 3M or 4M = -fA 

T 2 

Therefore we get: 

(5.71) Pr[RQ] = 3a^l 

We chose the radius of the circle circumscribed to the polygon as unit length and 
so we get 

(5.72) <p=-. 

n 

Then the coordinates of P v are 

(5.73) x v = sm (2i> + l)<p, y„ = cos <p- cos (2^ + 1 )<p. 
Upon putting those values in (5.60), we get: 

(5?4) ^..„,[*,(V + lV-*.P» + lW + *.2(,- < .^ 
•B^,„ = cos {2v + l)<p -cos (2/i 4- l)<p, 
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and hence 
(5.75) 



^^-1,^. = 2 sin 2<p sin yup, 



Bfi-i,^ = —2 sin <p sin 2fi<p. 

Upon substituting these values in (5.61), after simple transformations we get: 

A M ,„ = 16 cos 2 <p sin v<p sin [x<p sin (v-fx)(p, 

B^„ = 4 cos <p(cos 2v<p -cos 2 pip). 

In order to compute <p we put these values in the above expressions for a and j3 
and then write (5.71) in the form 

« • sin itp 

Using De Moivre's formula, we see the result for Pr [RQ] becomes (5.56). 

In order to show that Pr [RQ] decreases with increasing n, take the derivative 
of Pr [RQ] with respect to <w and we get 

(5.77) ^ = ^^_. 5 , 

da> 1(577 sin a) 

where 

(5.78) S = sinw(9cos to +52 cos w +44) — «(26cos w + 53 cos w+26). 
A power series expansion yields: 

(5.79) S = <o 5 -^a> 7 + ^ 9 - + ---. 

This is an alternating series and since for n g 5 (a> § 2ir/5) the absolute value of 
its terms is decreasing, we get for these values of n : 

(5.80) S>co 5 -j-a> 7 . 

Since the right hand side, and consequently d Pr [RQ]/da> is positive for ng5, 
Pr [RQ] decreases as n increases beyond n = 5. Noting that the values of 
Pr [RQ] are also decreasing for n = 3, 4, 5, we find the assertion is proved. 

If we let n-»oo, then w-*0 and Pr [RQ]-35/(12tt 2 ). The following values 
(Table 3) for Pr [RQ] had been determined by other investigators previously but 
substitution for n =3, 4, 6, 8, oo in the Alikoski result will provide the same 
answers. 

Some other problems in geometrical probability can be solved by noticing that 
their solutions depend on the solution to the Sylvester four point problem. For 
example, if two points A and B are taken at random in a convex area, suppose 
we seek the probability that two other random points C and D lie on opposite 
sides of the line AB. Once three points A, B and C have been placed, the 
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Table 3 



Triangle 


Square 
(Parallelogram) 


Regular 

Hexagon 


Regular 
Octagon 


Circle 
(Ellipse) 


1 


11 

36 
.3056 


289 
972 

.2973 


1181+867v2 
4032 + 2880v 2 

.2970 


35 


3 
or 

.3333 


12,r 2 
.2955 



situation is shown in Fig. 5.8. The quadrilateral formed by the four random 
points A, B, C and D is convex if and only if the fourth point D falls in an area 
marked CQ. Clearly the regions marked CQ have the same mean area by 
symmetry. There are two areas marked CQ on the same side of AB as C and 
one area on the other side. Thus if we know that the quadrilateral formed by A, 
B, C and D is convex, the probability is | that the fourth random point falls on 
the opposite side of AB from C. 




Fig. 5.8 



On the other hand, the quadrilateral ABCD is re-entrant if and only if the 
fourth point D falls in an area marked RQ. It does not seem intuitively obvious 
that the regions marked RQ have the same mean area, but this fact can be shown 
by the following considerations. If the fourth point falls in the topmost region 
marked RQ then the point B will be the re-entrant vertex of the quadrilateral. If 
the fourth point falls in the central region marked RQ, it will itself be the 
re-entrant vertex of the quadrilateral. Each one of the random points is equally 
likely to be the re-entrant vertex of the quadrilateral. Thus the mean areas of the 
regions marked RQ must be the same. There are two areas marked RQ on each 
side of the line AB. Thus the probability is \ that the fourth point D falls on the 
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opposite side of the line AB from the third point C. Hence the probability that C 
and D fall on opposite sides of the line AB is 

Pr [opposite sides of AB] = Pr [opposite sides | convex] Pr [convex] 

+Pr [opposite sides | re-entrant] Pr [re-entrant] 

(5.81) = *(l-Pr[RQ]) + ^Pr[RQ] 

= HPr[RQ] 

where Pr [RQ], the probability of a re-entrant quadrilateral being formed from 
four random points in a convex region, is the solution of the Sylvester four point 
problem. 

We have just shown that if K is any convex region and Oi, Q 2 , O3, and Q 4 are 
four random points in K, the probability Pr [RQ] that the four points form a 
re-entrant quadrilateral satisfies the relation 

(5.82) Pr [RQ] = 4 -6 Pr [same side of QiQ 2 ] 

where Pr [same side of Q1Q2] denotes the probability that Q 3 and Q 4 fall on the 
same side of the line OiQi- 

To evaluate Pr [same side of Q1Q2}, suppose the line O1Q2 has direction 
and distance p from some origin. This line splits the area A oi K into two 
sections with areas, say, <r(p, 0) and cr'(p, 0). If f(p, 0) is the joint density of the 
random variables p and 0, then 



(5.83) Pr [same side of OiQ 2 ]= [ 



cr 2 (p,0) + <r' 2 (p,0) 



A 1 A" 



f(p,6)dpd0 



which, by the symmetry between the roles of <x(p, 0) and cr'(p, 0), reduces to 
(5.84) Pr[same side of Q1Q2] =-3 = j j <r 2 (p, 6)f(p, 6)dpd0. 



\-i 



We shall obtain f(p, 0) as the limit as Ap -> and A<? -» of (A6> Ap)~ times the 
probability that the line O1O2 has direction between and + A.0 and distance 
between p and p + Ap from the origin. For any given position of Q u the line 
joining Qi and Q 2 has a direction between and + A0 if and only if Q 2 falls in 
the shaded region shown in Fig. 5.9. This happens with probability 

(5.85) Pr [direction Q { Q 2 e (0, <? + A0)| <?,] = — — + — . 
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A'_ 



Fig. 5.9 



Integrating this expression over a uniform probability distribution for all posi- 
tions of Oi in the shaded strip in Fig. 5.10, and multiplying by the probability 
that Oi falls in the strip (c(p, 8) Ap/A), we get 



Pr 



(5.86) 



Q1O2 has direction between 
6 and + A8 and distance 
between p and p + Ap from origin 

A9 rCip - e) 
1A 



f 

Jo 



[r 2 + (c(p,d)-rf] 



dr (c(p,d)Ap 



c(p, d) 



/ c(p,6)Ap \ 



Ad Apc 3 (p, 6) 
3A 2 ■ 




Fig. 5.10 
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Thus 

3 

(5.87) f(p,0) = ^-2 



3A' 



and 



2 r t 

(5.88) Pr [same side of OiQ 2 ] = j^i\ J cr 2 (p, 0)c 3 (p,6)dpd6. 

Calculating Pr [RQ] from Pr [same side of QiOz], we obtain the following: 
Theorem. Three points are taken at random in a convex region K with area A. 
The mean area of the triangle formed by these three points is 

(5.89) E(A) = A-j- 3 ^o- 2 (p,6)c 3 (p,e)dp 



de 



where 6 and p are the parameters of a line intersecting K, c(p, 0) is the chord length 
of that line within K, and a(p, 6) is the area in K on one side of the line. If a fourth 
point is taken at random in K, the probability that the four random points form a 
re-entrant quadrilateral is 



(5.90) Pr[RQ] = 4-—i 



cr 2 (p,6)c 3 (p,d)dpde. 



Sylvester's problem in three dimensions. Results analogous to those presented 
in the preceding section were obtained by Bohnslav Hostinsky (1925) for the 
three dimensional analogue to Sylvester's problem. Consider five points Qi, Q 2 , 
Q 3 , Q 4 and Q 5 independently and uniformly distributed over the volume of a 
convex region K in three-dimensional space. What is the probability that one of 
the points falls within the tetrahedron formed by the other four points? To 
obtain an expression for this probability, we shall trace Hostinsky's steps and 
shall first derive the probability that Q 4 and Q s fall on the same side of the plane 
passing through Q\, Q 2 and Q 3 . 

To calculate this integral, let us change our variables of integration from 

Xi, X2, X3, Y\, Y2, Y3, Z\, Z.2, Zt, 
to the new variables 

x[, X2, x 3 , y[, yi, y 3 , u, v, w 
where 

x[ = x u X2 = x 2 , X3 = x 3 , yi' = yi, y2 = y2, ya = y 3 
and u, v and w are the parameters of the plane 

ux + vy + wz + 1 = 
passing through the points Q x , Q 2 and O3. In other words, «, v and w are the 
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solution of the linear system of equations 

uxi + vyi + wz\ + 1 = 0, 

(5.91) ux 2 + v y 2 + wz 2 + 1 = 0, 

ux 3 + vy 3 + wz 3 + 1=0. 

The Jacobian of this transformation reduces to 



(5.92) 



w 

zli 

w 



-x 2 

w 

-yi 
W 



-x 3 

w 

zli 

w 



— [ux[ + vy[ + l] —[uxi + vyz + l] 



1 



^[ux'-i + vy'z+l] 



which, in turn, reduces to 

(5.93) 
or 



w 



x[ 


X 2 


xi 


yi 


yi 




l 


l 


l 



— \x[y 2 - x 2 y[ + x^y'i- x'-iy 2 + xiy'i- x[y'i\. 
w 

It is a well-known fact in analytical geometry that the area of a triangle whose 
vertices are (x[, y[),'{xi, yi) and {x' 3 , yi) is 

(5.94) ^p=2\x[y2-X2y[ J rX2yi~x 3 y2 + x' i y[-x[y3\. 

Thus our element of integration dx\ dy x dz x • • • dx 3 dy 3 dz 3 becomes 

2Ap 



(5.95) 



w 



dx[ dy[ ■ ■ ■ dy 3 du dv dw 



where Ap is the area projected onto the X-Y plane by the triangle QiQ 2 Q 3 . 
Multiplying and dividing by (u 2 + v 2 + w 2 ) 2 , we have 

(5.96) 



* — •-*) A 

u +v" + w\ 4 A dxidy[ ■ ■ ■ dy 3 dudvdw 



(u 2 + v 2 + w 2 ) 2 

We note that dudv dw/(u 2 + v 2 + w 2 ) 2 is the density for planes in space which is 
invariant under rigid motions of the coordinate axes. We can rewrite this density 
as sin <p dp dip dd where (p, <p, 8) are the spherical coordinates of the closest point 
on the plane to the origin. 
Then 



(5.97) 



sin <p cos 6 



v =- 



sin (p sin 6 



w =■ 



cos (p 
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and 

(5.98) 



'vu'+ v 



w 



4 
COS (p 



Since <p is the angle between the normal to the plane and the z-axis, we see from 
Fig. 5.11 that any increment d t d y of area on the plane projects onto the x-y 
plane as dx dy /cos <p. Thus our element of integration 



(5.99) 



becomes 
(5.100) 



2A/7 dx[dy[ dx 2 dy'2 dx'y dy'->, 



cos <p cos <p cos <p COS <p 



sin <p dp dip d6 



2\ d£\ dy 1 d^ 2 dy 2 d£ 3 dy 3 sin <p dp d<p dd 



where A is the area of the triangle QiQiQy, p, <p, and 8 are the coordinates of 
the plane determined by Qi, Q 2 , and Qy, and f k and y k are the coordinates of 
the point Q k (Ar = 1, 2, 3) with respect to a system of coordinates situated in the 
plane Q1Q2Q3. 




Fig. 5.11 



The plane Q1O2Q3 cuts K into two parts with volumes, say, X and 2,'. The 
measure of all positions of Q 4 and Q 5 such that they fall on the same side of the 
plane O1O2O3 is 

(5.101) [J J {{[ dx 4 dy 4 dz 4 dx s dy,dz, = l 2 + l' z 

and, since dx\ dy\ ■ ■ ■ dz?, is 

(5.102) 2A ^1 dy x d£ 2 dy 2 rffi dy 3 sin <p dp dcp do, 
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the probability that Q 4 and Q 5 fall on the same side of the plane O1O2O3 can be 
written (where P denotes the probability) 



P[same side of Q1Q2Q3] 
1 



(5.103) 



_2_ 



dx\ dyi 



\\\\\\\^ 



dz<, 



' +1! ) d£\ dy\ dg 2 dy>2 d^ dyi dp dw 



where dw denotes the increment in solid angle sin <p dip d6. 

The value of 1 2 + ~L' 2 depends only on the position of the plane Q1Q2Q3. In 
other words, the term (X +1' ) can be factored out of the integrations with 
respect to dfr and dy t . The integral 



(5.104) 



fj 



A d£ x dyi d£ 2 dy 2 d^ dy 3 



is just A 2 E(A), where E(A) is the mean area of a triangle formed by three 
random points in the area A of the convex section of the plane Q1Q2Q3 within 
K. We have already obtained a formula for £"(A), namely 



(5.105) 



E(A) = A 



-7J 



c a dq dip 



where a is the area of one of the parts into which the area A is divided by the 
line (q, ip), and c is the length of that line within the area A. The integration is 
extended over all secants of the area A. Hence we have 



(5.106) 



Pfsame side of Q1Q2Q3] = 775 



V 5 J 



f A 3 E(A)& 2 + Z' 2 )dpdv 
| A 4 ^cr 2 c 3 dqd<p] 



■[L 2 + Z' 2 ]dpdw 



where the integrations with respect to q and </r extend over all lines in the plane 
(p, <p, 6) which hit the curve of intersection of that plane with K, and the 
integrations with respect to the coordinates of this plane extend over all planes 
cutting K. 

We shall now proceed to find the mean volume of the tetrahedron T formed 
by four points Qi, Q 2 , O3 and Q 4 taken at random within K. 

The planes containing the faces of the tetrahedron T divide the volume V of 
K into 15 parts. A first part T is formed by the interior of the tetrahedron T. In 
order to designate the other parts, let t, represent the plane containing the face 
of T opposite the vertex Q„ and define the negative side of t, to be the side on 
which the vertex Q, is situated. For any permutation (1, /, k, m) of (1, 2, 3, 4), we 
denote by T t the part of K on the positive side of r, and on the negative sides of 
Tj, T k , and r m ; by Ti, the part of K on the positive sides of t, and t, and negative 
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sides of T k and r m ; and by T ijk the part of K on the positive sides of t„ t, and r k 
and the negative sides of r m . Using this notation, we can list the 15 parts of K 

(5.107) 

tO, 1 1, •« 2, i 3, •« 4; J 12, •« 23, i 31, i 14, i 24, I 34 - , i 123, •« 124, i 134, i 234- 

If a fifth point is taken at random in K, designate by po, Pi, Pa or p jk the 
probabilities that the fifth point falls in T , T t , T t j or T ijk , respectively. By 
symmetry, we have 

Pl=P2 = P3=P4, 

(5.108) Pl2=P23=P31=Pl4=P24=P34, 
Pl23 ~ P\24 = P\34 = P234, 

and, since the fifth point must fall in one of the 15 sections of K, 

(5.109) Po + 4pi + 6pi2 + 4p 123 =l. 

Let Q 4 be a fifth random point in K, and let U be the tetrahedron with vertices 
Qi, Qz, C?3 and Qs. The volume of K is divided by the planes of the faces of the 
tetrahedron U into 15 parts which we can represent 

(5.110) 

U , U u U 2 , Ui, U 5 ; U l2 , U 23 , U ilt U l5 , U 25 , U 35 ; U m , U i2 s, U 13S , U 23S , 

where U h f/;„ and U ijk are defined analogously to T t , T L , and T ijk . 

We can obtain relations between the t/'s and 7"s in the following way. From 
Fig. 5.12 we see that if Q 4 and Q 5 are on the same side of the plane Q1Q2Q3, 
then Q s and d fall on opposite sides of the plane O2O3O4 if and only if Q 4 and 
Oi fall on the same side of the plane O2O3O5. By Symmetry, the same statement 
is true with Oi interchanged with Q 2 or 03- Hence Q s is in T l23 (i.e., on the 
opposite side of Q2Q3Q4 from Qi, the opposite side of O1Q3O4 from Oi, the 
opposite side of Q1Q2Q4 from C? 3 and the same side of O1Q2Q3 as Q 4 ) if and 
only if 4 is in U (i.e., on the same side of Q2Q3Q5 as Qi, the same side of 
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QiQiQs as Q 2 , the same side of Q1Q2Q5 as Q 3 and the same side of Q1O2Q3 as 
O4). Thus 

(5.111) Pl23=P0- 

Similarly Q 5 is in T\ (on the opposite side of Q2Q3Q4 from Q\, the same side 
of Q1Q3Q4 as Q2, the same side of Q1Q2Q4 as Q 3 , and the same side of 
Q1Q2Q3 as Q 4 ). Thus 

(5.112) Pl=P23- 

Since the indices are interchangeable, 

Pl2=P3 

and, since by symmetry p 3 = p 1; we have 

Pl2=Pl- 

Combining the three equations 

(5.113) Po + 4pi+6pi2 + 4pi23= 1, Pl23=Po, Pn=Pu 

we get 

(5.114) 5p + 10pi = l. 

The points O4 and Q$ fall on the same side of the plane Q1Q2Q3 if and only if 
Qs falls on the negative side of the plane t 4 , i.e., if and only if Q 5 falls in a region 
Ti, Tij or T,,fc where neither / nor j nor k equals four. Thus 



(5.115) 



P[samesideof Oi0 2 03] = Po + Pi + P2 + P3 + Pi2+P23 + P3i + Pi23 



= 2p + 6pi. 
Eliminating pi from the last two equations, we get 



(5.116) 



Po = --P[same side of Q1Q2O3] 



which, together with our expression for Pfsame side of O1Q2O3], leads to: 

Theorem. The mean volume of the tetrahedron whose vertices are four random 
points in a convex region K of volume V is 



(5.117) 



E(V T )=v[~ 



V 5 
_2_ 



A 3 £-(A)(2 2 + r 2 )JpJw 



4 



co- dqdtp 



[l 2 + l' 2 )]dp 



dw 



where A is the area of the intersection of K with the plane having coordinates 
(p, w) [i.e., (p, if, 8)], E(A) is the mean area of a triangle formed by three random 
points in that area, X and X' are the volumes of the parts into which the plane 
divides K, and the integrations over dp dw extend over all planes intersecting K. In 



CROFTON'S THEOREM AND SYLVESTER'S PROBLEM 



123 



the second expression c is the length of intersection of the area A with a line having 
coordinates (q, i//) in the plane (p, w), a is the area within A on one side of that 
line, and the integrations over dq d\\> cover all lines in the plane (p) which intersect 
K. If five random points are taken within K, the probability that any one of the five 
points falls within the tetrahedron formed by the other four is 



(5.118) 



/ , [RP] = 3--^ J 



f A 3 E(A)(1 2 + T 2 )dpdw 



= 3- 



10 



A 4 - 



„ 3 2 
c a 



dqdtl/)(l 2 + T 2 )dpdw 



(where RP denotes re-entrant polyhedron). 

We can apply these general results to the case where the region K is a sphere 
of radius r. A plane at dist ance p from the center of the sphere intersects the 
sphere in a circle of radius v/- 2 -p 2 and area 

A = w(r 2 -p 2 ). 

From our discussion of Sylvester's problem for random points in a circle, we 
recall that the mean area of a triangle formed by three random points in a circle 
is 35/(4877-) times the square of the radius of the circle. Thus for three random 
points on the circle of intersection of the sphere with the plane at distance p from 
the sphere's center, the mean area of the triangle formed is 



(5.119) 



*(A) = £VV). 



The volume 1 in the sphere on the side of the plane, say, away from the 
sphere's center is 



(5.120) 
Thus 

(5.121) 



Z = ^7r(r-pf(2r + p). 



l 2 + -L' 2 =lir 2 [Sr 6 -4r 3 (r-p) 2 (2r + p)+(r-pf(2r+p) 2 } 



= -77- [4r +9r p -br p +p J 



and the formula 



(5.122) P[same side of Oi0 2 3 ] = ^5 \\\ E(A)A 3 (2 2 + X' 2 ) dp dw 
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becomes 

Pfsamesideof 1 2 3 ]=^75 f l~-)(r 2 - p 2 )W(r 2 - p 2 )f 

XiTTT ) J_ r \4o7r/ 

(5.123) • ^ir 2 (4r 6 + 9r 4 p 2 -6r 2 p 4 + p 6 )] dp 

_ 84 
~ 143' 

Thus when K is a sphere, the mean volume of the tetrahedron formed by four 
random points in K is 

3 



(,m> B(vrf - „>(-£)-!„>(£)-: 



and the probability that five random points in a sphere form a re-entrant 
polyhedron is 

9 

143' 



(5.125) P[R?] = 5E(V T )/{^7rr 3 ) 



For convex regions other than the sphere, Hostinsky's formula for P[RP] does 
not seem to be tractable. Explicit values of P[RP] for random points in non- 
spherical regions such as tetrahedrons, parallelepipeds, etc., have apparently not 
yet been successfully calculated. On the other hand, from the relation 

l>P[RP] = 5p 
we have 

1 

Po<- 5 . 

In other words, the mean volume of the tetrahedron whose vertices are four 
random points in K is less than one fifth the total volume of K, no matter what 
shape K has (so long as it is convex). 
It should also be noted that the above bound on p and the relation 

3 

(5.126) p = --i'[samesideof O1O2O3] 

yield the bounds 

(5.127) ^<P[samesideof0 1 2 03]<^ 

regardless of the shape of K. 

For the case of four random points being taken in a convex region K on the 
place, Sylvester conjectured and Alikoski proved that the probability P[RQ] 
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that the four points form a re-entrant quadrilateral is minimized when K is a 
circle and maximized when K is a triangle. The analogous conjecture for the 
three-dimensional problem is that the probability F[RP] that five random points 
in K form a re-entrant polyhedron is minimized when K is a sphere and 
maximized when K is a tetrahedron. This conjecture remains unproven. 

Kingman (1969) considers a generalization of Sylvester's problem in n 
dimensions. If n + 2 points are chosen independently at random inside K, then 
the convex hull of these points is a convex polytope with either n + 1 or n + 2 
vertices. We want to know the probability of its having n + 1 vertices. This 
probability is (n +2)E{A(x ,xi, ■ ■ ■ ,x n )}/V, where A(x , Xi, ■ ■ ■ , x n ) is the 
volume of the convex polytope formed by the points x , ■ ■ ■ , x n . In particular, it 
is found that for an n -dimensional ball, 

(5128) s^^.^ : ^ /( ^ )r 

If we substitute n = 3, it follows that the probability that the convex hull of five 
points chosen uniformly inside a sphere has four vertices, is 9/143 and confirms 
the Hostinsky result in three dimensions for the sphere. 
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CHAPTER 6 

Random Chords in the Circle and the Sphere 



Techniques have recently been developed whereby cells in mitosis may be 
photographed. For the normal human cell the resulting picture, called a 
karyograph, displays 22 pairs of like chromosomes plus two sex chromosomes, 
like in the female and unlike in the male. See Fig. 6.1. Abnormal cells may have 
more or less chromosomes than these. The positions of the chromosomes on a 
karyograph present an interesting problem in geometrical probability; namely, 
to what extent is the position of each member of a pair of chromosomes random 
in the cell nucleus with respect to the other member of the pair. For example, are 
the chromosomes of a pair located closer to each other in general than would be 
two points randomly dropped in the cell nucleus? This latter question was 
investigated by Barton, David and Fix (1963), who used data from 70 kary- 
ographs to test the hypothesis that the chromosomes of a pair are randomly 
distributed within the picture of the cell nucleus versus the alternative hypothesis 
that there is a force of mutual attraction (or repulsion) between the two 
chromosomes in each pair. 




nucleus wall 



Fig. 6.1 
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To retrace the analysis done by Barton, David and Fix, we shall need to 
derive the distribution of the distance between two points randomly dropped on 
a picture of a cell nucleus. For simplicity we will assume the boundary of the cell 
nucleus in a karyograph is a circle. Actually the nucleus is only approximately 
spherical and its boundary often appears to the camera more like an ellipse. We 
shall make some allowances for this ellipticity later on in our analysis, but for the 
momemt we shall assume the boundary is a circle and shall derive the probability 
density function of the distance, x, between two points randomly dropped on a 
circle of radius R. Let P denote the probability that the two points are separated 
by a distance between x and x + Ax. Let Pi denote the same probability when 
one of the points is known to be on the circumference of the circle. Then by 
Crofton's fixed point theorem we have 

(6.1) dP = 2{Pi-P)V- 1 dV 

where V in this case is the area of the circle, that is, V = ttR and dV = 2ttR dR. 
We see from Fig. 6.2 that when one point is on the circumference, the second 
point must fall in a section of an annulus (shaded in Fig. 6.2) for the two points to 
be separated by a distance between x and x + Ax. For infinitesimal Ax, the area 
of this annulus section is 2<f>x Ax, where <f> can be seen from the figure to be 
cos -1 (x/(2R)). Hence the probability that the points are separated by a distance 
between x and x + Ax is 

„ 2x Axco^ 1 (x/(2R)) 

(6.2) Pi = -J^- * 

Substituting into Crofton's formula we have 

"2x Ax cos -1 (x/(2R)) 12 dR 



dP 



_ 2 pxAxcos L (x/(2R)) r 
L ttR 1 



R ' 

Rearranging terms we get 

„ ,„ <„ Jr . 8x Ax cos^ 1 (x/(2R)) 

RdP + 4PdR= ^^ -, 

ttR 

and multiplying both sides by R , we get 

„4 ,„ .„a„ J „ 8jc Axi? cos -1 (x/(2R)) 
R 4 dP + 4R 3 PdR= — -, 

TV 

and integrating both sides, we obtain 
p/?4 = 4 f _ 2 Axf2i? 



(6.3) 



Jt«-&)« 



where C is a constant to be determined. For R = x/2, the two random points 
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would have to fall on the circumference diametrically across the circle from each 
other. This is, of course, a zero probability event. Thus when R — x/2, p = 0. 
This implies C = 0. Thus we have derived the probability density for the distance 
between two random points dropped in a circle: 

1= 2fri ro w-^-^/i. "^ 



(6.4) p{x)-- 



(s)-sv- 



'R 2 \.ir"~ \2Rl ttR v ~ 4R 
This is the density of x under the null hypothesis (randomness) 



(0<jc<2/?). 




first point 



Fig. 6.2 



For the alternative hypothesis we suppose there is a force of attraction (or 
repulsion) between the chromosomes of a pair. We shall describe this force in 
terms of its effect on the distribution of x, namely that the force skews the null 
hypothesis density, p(x |H ), by a factor of x a for some power a > —2. Thus the 
density of x under the alternative hypothesis, Hi, is 

(6.5) p(x\H 1 ) = Kx a p(x\H ). 

In order that pix |Hi) integrate to unity we must take for the value of K 

(<*+2)tt (a+2)ir 



(6.6) 



K 



2° j3(3/2, l/2(a +3)) 2" +4 /3(3/2, (a +3)/2) 



where f3(a, b) denotes the Beta function, T(a)Yib)/Yia +b). Thus 
(a+2V a+1 



(6.7) p(*|H,) = 



2 a+ ' , /3(3/2,(a+3)/2)i?- 



- cos "(^)-^v 



77 



AR 



'<}■ 



Negative values of a shift the density to lower values of x; that is, negative 
values of a correspond to a force of attraction and positive values of a to a force 
of repulsion between the chromosomes of a pair. 
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Suppose n pairs of points are dropped with the same force of attraction 
between the members of each pair but with the members of different pairs 
independent of each other. Let us define 

D=txf 

where x t denotes the distance between members of the /th pair. From our 
expressions for p(x |H ) and p(x |Hi) we get 



(6.8) 



E[x l H °J- (k + 1)(fc + 2) U + 1 )' 

E t 2 km , 2 2k R 2k (<* + 2)r((a+3)/2 + k)r(a/2 + 3) 
lX ' lJ (a + 2k+2) r((a+3)/2)r(a/2 + 3 + fc)' 



Thus we have 



E[D\H ) = nR 2 , 
Varp|H ] = yi? 4 , 



(6.9) 



K 3 [D \H ] 



nR 6 



nR* 



2 
X 4 [D|H ] = 

where K r is the rth cumulant. Now 

i2 (a + 2)(a+3) 



£[D|Hi] = 4/xi?' 



(a+4)(«+6)' 



(6.10) Var[D|H l] = 16n^ ( ? + 2 j ( X +3) ^ +32) 



^ 3 [£>|Hx] = - 



(a+4f(a+6) 2 (a + S) ' 

2 9 /? 6 «(a+2)(a+3X5a 4 + 54a 3 + 52a 2 -672a-1440} 
(a + 4) 3 (a + 6) 3 (a + 8) 2 (a + 10) 



For the values of n which we are considering (23 for each karyograph), the 
standardized cumulants of D of order three and above are sufficiently small 
under both H and Hi that the distributions of D under H and Hi can be 
approximated by normal distributions with the means and variances given 
above. For example when a = 1, we get y x = 0.5154h~ 1/2 , y 2 = -0.5277n 5 for n 
reasonably large. Yet the normal approximation may be inadequate when a = 
-1, for then yi = 1.7612n~ 1/2 , y 2 = 2.9345« _1 and are not that close to zero. 

One advantage of making this normal approximation is that it reduces the 
amount of work necessary to investigate the effect of noncircularity of the 
nuclear walls. Since the normal distribution is completely specified by its mean 
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and variance, we need only investigate the effects on the first two moments when 
the boundary is an ellipse and not necessarily circular. 




z sin S 



px cos 



Fig. 6.3 



Consider an ellipse of area ttR 2 (i.e., the same area as a circle of radius R) and 
let p (0<p < 1) be the ratio of the smaller to the larger axis. This ellipse can be 
regarded as the result of a scaling down in one direction, by a factor p, of a circle 
of radius R/vp, the direction being that of the minor axis of the resultant ellipse. 
Any pair of points A' and B' separated by a distance x' in the ellipse will have 
arisen from two points A and B separated by some distance x in the circle (see 
Fig. 6.3). Define 8 as the angle between the line AB and the direction which is 
scaled down when we transform the circle into the ellipse. Suppose points A and 
B have fallen in the circle either randomly (H ) or with an attraction or repulsion 
between them (Hi). Then all values of 8 between and n are equally likely 
under both Ho and Hi. The transformation from the circle to the ellipse leaves 
the component of x parallel to the major axis unchanged, while the component 
parallel to the minor axis is multiplied by a factor of p. Thus we have 



E[x' 2k ] = E([(;c sin df + (px cos 6») 2 ] k ) 



(6.11) 



= E[x 2k }- 

IT J 



' 7r k /k\ 

in 

;=oV// 



p ' cos ' 6 sin 



2k -2 j 



ode. 



Since cos 2 ' 6 sin 2fc 2 ' 6 is symmetric about tt/2, we need integrate only from to 

■nil. 

,2 * /fc\ 



(6.12) 



£n»B[x a ]-i(Vf" 

irj= \]l J 



co& 2i zm 2k ~ 21 8 dB 



-^UOM^-^)- 
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Thus for k = 1 and k = 2 we get 

£[*' 2 ] = (l/2)(l+p 2 )£[* 2 ], 
(6.13) 

E[x' 4 ) = (l/S)(3 + 2p 2 + 3 P 4 )E[x 4 ]. 

For a circle of radius R we have already obtained 

(6.14) E[x>\Ho] = R\ i^lHJ^f^^). 

(a + 4)(a + 6) 

(6.15) 

E[x 4 |H ] = (5/3)K 4 , E[x*| Hl ]-16^|2L±^±7j^L±3) 

(a + 6)r(2[a + 3])r(5a + 5) 

But the circle in which the points A and B are dropped has radius /?/vp. If we 
replace i? by i?/vp in the above expressions for E[x 2 |H ], ij[jt 2 |Hi], £ , [x 4 |H ] 
and E[x 4 |Hi] and substitute these results into our expressions for £[x' 2 ] and 
E[x' ] we get 

Lp J (a+4)(a+6) 
£[*' 4 |Ho] = ^[A + 2 + 3p 2 ]* 4 , 

T3 ,1 4 r((a+7)/2)r(a/2 + 3) 

Hence, if n pairs of points fall in an ellipse with major axis /?/vp and minor axis 
vp R (that is, an ellipse whose axes have the ratio p (0 < p < 1) and whose area is 
the same as that of a circle of radius R ) and if 

D'=i x ; 2 

where x\ is the distance between members of the /th pair, then 



(6.16) 
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Using normal approximations with the above means and variances for the 
distributions of D' under H and Hi together with data from 70 karyographs, 
Barton, David and Fix found that the random hypothesis is tenable for normal 
male cells and for abnormal male and female cells. On the other hand the 
random hypothesis is not tenable for normal female cells. The maximum likeli- 
hood estimator for a in that case is typically -.4 or -.5 indicating an attraction 
between chromosomes of homologous pairs in the normal female cell. 

One question still to be investigated is whether the results of the randomiza- 
tion model used in the above analysis (points random in a circle) would differ 
very much from the results of a randomization model in which the points are 
random within the volume of a sphere and are then projected onto a plane, thus 
yielding a different kind of "random" points in a circle. This latter model would 
seem to better reproduce the physical condition of the three dimensional cell 
nucleus projected by the camera into the two dimensional karyograph. Since this 
latter model would tend to cluster the "random" points closer to the center of 
the circle, we might find that the randomness hypothesis under this latter 
randomization model would be tenable for normal female cells as well as the 
male and abnormal cells. 

Random chords in the circle. Another approach to the chromosome problem 
is to study the lines joining the chromosomes of each pair and see whether the 
number of intersections of these lines within the nucleus wall could have arisen 
through some random mechanism. This approach was taken by David and Fix 
(1964). For simplicity the nucleus is taken to be circle of unit radius. Each line 
drawn through a homologous pair of chromosomes forms a chord of our circular 
nucleus. For the chromosome problem there are 23 such chords, but for 
generality we shall suppose we have n chords in a circle. The maximum number 

of intersections which these chords make within the circle is ( J, that is, every 

chord intersects every other chord within the circle, and ( I is the number of 

pairs of chords which can be formed from a set of n chords. It is a feature of the 
chords drawn through the chromosomes of homologous pairs that the number of 

intersections within the circle averages about 80% of ( J. We want to know 

whether 80% of ( I is a typical number of intersections for random chords 

generated by some randomization model which is consistent with the physical 
set-up. We shall consider six different randomization models by which random 
chords in a circle can be formed. For each of these models we shall derive the 
mean and variance of the random variable r, the total number of intersections 
within the circle between the n chords. Another way of measuring how 
frequently the random chords intersect within the circle is to count the number 
of intersections, w, which the n random chords make within the circle with an 
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additional random chord placed in the circle according to the same random 
mechanism as the other n chords. We shall find the mean and variance of w as 
well as r for each of our randomization models. 

Define a random variable a;,- to be the indicator function of the event that the 
ith and /th chords intersect within the circle, that is, 

1 if chords i and / intersect within the circle, 



[0 otherwise. 

Clearly, E(aa) is just the probability that two particular chords intersect within 
the circle. We can represent w, the number of intersections with one particular 
chord (say chord O), as 

(6.19) w=ia io 

and 

E(w)= nE(a io ), 



(6.20) B(w 2 ) = E(ia io i^o) 

V=i i = i / 



= nE (a io ) + n(n-\)E (a io a io ), i i* j. 

The other random variable of interest, r, the total number of intersections within 
the circle, can be written 

(6-21) r = \ I £ «„• 

^ i=i y=i 

so that we have 

(6.22) E(r)^n(n-DE(au), 



E{r 2 ) = -E(i U I I «fc») 

4 \, = 1 , = 1 k = \ m = \ ' 



I 

ivy 



If"" 

~ A E I I (fXipLij + aifLu 

4 Li = i, = i 



) 



n n n 



+ L E I (ctijetik+ctijCtki + «.-,«,* ->raija kj ) 

i = l ; = 1 fc = l 
;*i,i*fc,fc?«j 



(6.23) 



n n n n ~| 

+ X I I I CtijOtkm 
i = \ 1 = 1 k = l m=l J 



i = l ; = 1 k = \ m=l 
no two indices alike 



= -[2n(n - l)E( aii )+4n(n - \){n-2)E((x ii a ik ) 

4 i*k 

+n(n - l)(n -2)(n -3)E( ai/ a km )]. 

i*j*k*m 
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Since a„ and <x km are independent random variables when none of the indices 
are the same, we have 



E(a i ja km )={E{a ii )f 



and 



(6.24) 



i¥-\*k*m 



E(r 2 ) = \ [In {n - 1 )£(<*,,) + 4n (n - 1 )(« - 2)£(a,,a lfc ) 



+n(n-l)(n-2)(«-3)(£(a„)) 2 ]. 



Thus in order to evaluate the first two moments of w and r we need only 
evaluate fj(a,y), the probability that two random chords intersect within the 
circle, and Eiayajk), j ^ k, the probability that two random chords each intersect 
a third random chord within the circle. 

To derive the probability that two chords intersect within the circle, we shall 
first derive the probability that two chords of given lengths intersect within the 
circle. The most convenient parameter by which to specify the length of a chord 
is half the angle subtended by the chord (0 in Fig. 6.4). Since our circle has unit 
radius, the arc intercepted by the chord has length 26. 




Fig. 6.4 



Suppose we have two chords with half angles di and 6 2 and suppose 6\ > 6 2 . 
Once chord 1 has been placed, chord 2 will intersect chord 1 if and only if either 
the counter-clockwise endpoint of chord 2 falls in shaded region A (see Fig. 6.5) 
or the clockwise endpoint of chord 2 falls in shaded region B. Each of these 
events has probability 26 2 /(2tt). Since 9 2 < $i, it is impossible for both events to 
happen simultaneously. Hence 



P[A\JB} = 



26 2 



TT 
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chord 1 



Fig. 6.5 



Thus we have obtained 



Pfintersection 1 6 X , 6 2 ] = — min (0 U 2 ) 

IT 



or, in terms of ai 2) we have 
(6.25) 



E(ai 2 |0i,0 2 ) = --min(0i,02). 

IT 



We can calculate E(ai 2 ) by integrating the above expression 

ir/2 • ir/2 



(6.26) 



E(a l2 ) = -\ min(0 l ,0 2 )f(0 l )f(0 2 )d0 l d0 2 

v Jo Jo 



77 J 



r/2 



o 2 f(d 2 )f(0i) de 2 de, 



where f{9) is the probability density function of the random variable 6 (i.e. 
(A0)' 1 times the probability that the random chord will have a half angle 
between d and + A0 for infinitesimal A0). 

For the second moments of w and r we must calculate E(a l2 a 13 ). When the 
lengths of chords 1, 2 and 3 are known (equivalently, for given U 2 and f? 3 ), the 
event that chords 1 and 2 intersect within the circle is independent of the event 
that chords 1 and 3 intersect within the circle. Thus we can write 



(6.27) 



E(a 12 a 13 1 0i, 2 , e 3 ) = £(<*i2 1 «i, 02)£(«i3 1 0i, fl 3 ) 
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and 



7I-/2 . -rr/2 . tt/2 



2d0 3 



, n/2 . -n/l 



2 
77 J 

4 f' 



- TV I Z. ^ TV/ Z. - 777 ^ 

£(a 12 a 13 )= E(a 12 \0 u 2 )E(a 13 \6i,d 3 )f(6 l )f(d2)f(d 3 )de 1 d8 

Jo Jo Jo 

min (0,, 2 )min (0!, 3 )f(d l )f(6 2 )f(e 3 )d0 l d0 2 d0 3 

f ' f ' e 2 e 3 f(e 3 )f{e 2 ) de 3 do 2 

LJo Jo 

W2 .9, 

6 1 e 3 f(d 3 )f(e 2 )d0 3 de 2 

9i Jo 



2 , 
77 J 



(6.28) 



8, ,.ir/2 



e 2 e 1 f(e 3 )f(e 2 )d0 3 de 2 



J«j 

72 -tt/2 



f(e 1 )d$ l 



6{f(e 3 )f(e 2 )de 3 d0 

4 f /2 r/f 9 i \2 r v/2 r e i 

= — 02/(^)^2+20! 02/(02)^02 B 3 f{e 3 )d6 

77 J L\J / Jfl, Jo 

+0i 2 (j f(o 2 )de 2 y 

77 J 



/(fli)de, 



r/2 r , 0, .-rr/2 n2 

02f(e 2 )de 2 +o i f(e 2 )de 2 



f(e i )de 1 



Clearly /(0) depends on what randomization model we use to define a "ran- 
dom chord." We shall now consider six randomization models and derive the 
probability density /(0) for each model. Then we shall be able to calculate E(ctjj), 
E(aijaik), ] ^ k, and the first two moments of w and r for comparison with 
observed values taken from karyographs. The first five models appear in David 
and Fix (1964) and the last model is developed by Dufour. 

Model I. The simplest randomization model is to specify each chord by its 
endpoints, which we take to be uniformly and independently distributed over the 
circumference of the circle. The density of 6 is, for infinitesimal A0 the proba- 
bility of the event that half the angle subtended by the chord is between and 
+ A0, or, equivalently, of the event that the arc intercepted by the chord has 
length between 20 and 20 + 2 A0. (See Fig. 6.6.) This event occurs if and only if 
the chord's second endpoint falls at a distance between 20 and 20 + 2 A0 along 
the circle from the first endpoint. Since there are two such infinitesimal arcs 
where the second endpoint can fall (see Fig. 6.7) and each arc has length 2 A0, 
the probability of this event is 



P[(8, + A0)] = 4A0/(2tt). 
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28 + 2\« 



Fig. 6,6 




Fig. 6.7 



Now f(0) is 

Thus 
(6.29) 



f(0)= lim (\9) l P[(O,e + A0)]. 



f{6) = ~ (OS0S7T/2). 

7T 



Model II. Any chord of a circle (except a diameter) is completely specified by 
the length and direction of the line segment from the center of the circle to the 
midpoint of the chord. For this randomization model let the length, p, of this line 
segment be uniformly distributed on [0, 1] and let the angle <f> which this line 
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Fig. 6.8a 



Fig. 6.8b 



segment makes with some axis be uniformly distributed on [0, 2n]. (See Fig. 
6.8a.) A chord will intercept an arc of half angle between and 8 + Ad if and 
only if p falls between cos (8 + A8) and cos 8 (see Fig. 6.8b). The probability of 
this event is cos (#)-cos (8 + Ad) which is approximately sin(8)A8 for very 
small A8. Thus in this case the density of is 



(6.30) 



f(0) = sind (O§0S<7r/2). 



Model III. For this model suppose that a point is dropped at random in 
the circle and the chord is drawn which has this point as its midpoint, that 
is, the chord is drawn perpendicular to the line joining the random point with 
the center of the circle. (If the perpendicular to the chord from the center of the 
circle has length p and makes an angle <f> with some prescribed axis, then the 
joint density function of p and 4> is (1 / ir)p dp d<f>, O^p^ 1, 0S$^27r.)Then a 
chord will intercept an arc of half angle between 8 and 8 + A8 if and only if its 
midpoint falls in the annulus between the circles of radius cos 8 and cos (6 + A8) 
(see Fig. 6.9). The probability of this event is w[cos 2 0-cos 2 (O + A0)]/ir which, 
for infinitesimal A8, is 2 cos 8 sin 8 A8. Thus 



(6.31) 



/(0) = 2cos0sin0 (0^8^tt/2). 



Model IV. Let us now define a random chord by dropping a point at random 
in the circle and drawing a chord through that point in a randomly chosen 
direction. Suppose the random point falls at a distance between x and x + Ax 
from the center of the circle. This event has a probability equal to the area of an 
annulus of radius x and width Ax, divided by the total area of the circle, that is, 
27m; Ax/tt or 2x Ax. Thus the density of x is 2x for OSi^ 1. Given that the 
random point falls at a distance x from the center of the circle, the random chord 
intercepts an arc of half angle between 6 and 8 + A0 if and only if the random 
chord falls in one of the shaded regions A and B shown in Fig. 6.10, that is, if 
and only if the random direction falls between two chords which intercept arcs of 
28 and 28 + 2 A8 and which cross at a distance x from the center of the circle. 
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Fig. 6.9 



The region B in Fig. 6.10 is just a reflection of region A. Therefore the total 
angular measure of the region AUB about the random point is just twice the 
angle A/3 of the region A. Thus the probability which we seek is just 2 A(3/tt. The 
two arcs si and s 2 in Fig. 6.10 must satisfy 



(6.32) 



Si-s 2 = 2Ad. 




16 r 216 



The relation between s x , Si and A/3 is evident from a more detailed version of 
Fig. 6.10. First it should be noted that the shorter chord makes an angle 6 with 
the circle. For small Ad the longer chord is almost coincident with the shorter 
chord and it makes an angle approximately 8 with the circle. Thus the width at 
the top of the region A is approximately s\ sin 6 when Ad is very small. This 
width can also be written as A/3 times the distance from the random point to the 
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top of the region A. (See Fig. 6.11.) Thus we have approximately (exactly for 
infinitesimal A0) 



(6.33) 

and similarly 



A/3 (sin 6 + V;t 2 -cos 2 <9) = Sl sin 



A/3 (sin 6 - Vx 2 -cos 2 0) = s 2 sin 6. 
Solving for si and s 2 and substituting into Si — s 2 = 2 A(? we get 
[sin 0+\/;t 2 -cos 2 sin^-Vx -cos 6 



(6.34) A/s[- 

which yields 



sin 6 



sin0 



l l- 



2A0 



A/3 



sin A<9 



Vx 2 -cos 2 



2 „' 



Given that the random point falls at distance x from the center of the circle, the 
probability that the random chord intercepts an arc of half angle between 6 and 
6 + A0 is 2 A/3/ tt or 



(6.35) 



P[(6,6 + A0)\x] = 



2 sin 6 Ad 

7rvx 2 -cos 2 6 




s 2 sin9 



Fig. 6.11 
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The unconditional probability is obtained by integrating the above expression 
over the density of x. We have shown that the density of x is 2x for Osl x Si 1. 
Note that we need integrate only from x = cos to x = 1 since it is impossible for 
a chord to intercept an arc of half angle $ if any point on the chord is closer than 
cos 6 to the center of the circle. Thus 



P[8,8 + A8)=\ "T""" 2 xdx 



2 sin 6 Ad 

i « fl-VJC - cos 



= — sin 8 A8 [vV-cos 2 *?] *=*<*« 



4 2 , 

= - sm 2 Ad. 



Hence for infinitesimal A8 we have 

(6.36) /(«) = - sin 2 (osiflsij). 

Model V. For the fifth randomization model let us generate our random 
chord by dropping two points randomly within the circle and drawing a chord 
through both points. The derivation of f{d) for this model is very similar to that 
for the previous model. The density of the distance x from the first random point 
to the center of the circle is, as before, 2x. Given that the first point is at distance 
x from the center of the circle, the second random point must fall in the region 
A U B as defined for the previous model in order that the chord joining the two 
points intercepts an arc of half angle between 6 and + AO. The difference 
between this and the previous model is that there our probability was the total 
angular measure of AUB divided by 2ir, whereas here our probability is the 
total area oi AUB over the area of the circle, ir. The area of a wedge shaped 
region of radius R and angle y is \R 2 y. (See Fig. 6.12.) Thus the area of the 
region A is (except for terms of order (A0) 2 ) 

Area (A ) = - (sin 9 + J x 2 - cos 2 df A/3 + - (sin 6 - Vx 2 -cos 2 0) 2 A/3 

(6.37) =( x 2 -cos 2 + sin 2 0)A|3. 

The probability that the second random point falls in the region A U B is 

(6.38) -Area(AU£) = -Area(A) = -(x 2 -cos 2 + sin 2 0)A/3. 

IT TT TT 



As in the previous model we have 



sin 6 Ad 

A P = n f=- 

Vx -cos 8 



Hence, given that the first random point falls at distance x from the center of the 
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-Area = j« 2 ? 



Fig. 6.12 



circle, the probability that the chord going through the two random points 
intercepts an arc of half angle between 6 and 6 + Ad is 



2.2 2 „ • 2 „, sin 6 A$ 

~~" n ' — 9) r 
«" Vjc z -cos z 



P[(0, + A0)|x]=-(x -cos 2 0+ sin 2 0) _ 
and, as in the previous model 

r 1 2 

P[(0,0 + A0)] = -(* 2 -cos 2 + sin 2 0) 



sin A0 
Vx 2 -cos 2 



2jc i/jc 



= — sin 6> A0 



•r 



2 
+ -sin 3 A0 

77 



(a: -cos 0) 2x dx 



2xdx 



' cosS (x 2 -cos 2 0) 1/2 



(6.39) 



?g(x 2 -cos 2 0) 3/2 ] J 



= — sin0 A0|^(x 2 -cos 2 

7T 



+ -sin 3 A0[2(x 2 -cos 2 0) 1/2 ]x=L< 

7T 

4 4 

= — - sin 4 Ad + — sin 4 Ad 
itr it 

= — sin 4 0A0. 



Dividing by A0 and letting A0 become infinitesimal we get 
(6.40) 



/(0) = ^sin 4 (Og(9g7r/2). 



Model VI. The randomization model which is perhaps most applicable to the 
distribution of chromosomes in a cell nucleus is the following. Consider the 
nucleus to be a sphere of unit radius, let two points be placed at random inside 
the sphere and project these two points onto a plane passing through the center 
of the sphere. The sphere projects onto the plane as a circle of unit radius and 
the two projected points in this circle determine a random chord as in the 
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previous model. This model is an attempt to take into account the fact that the 
camera projects the three-dimensional cell nucleus onto a two dimensional 
picture. Our method of obtaining f(6) for this model will be very similar to that 
used for the previous two models. The first random point in the sphere must fall 
in a cylindrical shell of radius x, thickness Ax and length 2v 1 -x 2 (see Fig. 6.13) 
in order that its projection lie at a distance between x and x + Ax from the center 
of the circle. The probability of this event is the ratio of the volume of the 
cylindrical shell to the volume of the sphere, namely 

P[( X , X + Ax )] J^ X ^ 2 ^ ?) 



(4/3)tt 



(6.41) 



= 3xAWl 



plane of 
projection 





Side view 



Top view 



Fig. 6.13 



Thus dividing by Ax and letting Ax become infinitesimal, we find that the 
probability density of x, the distance f rom th e projection of the first random 
point to the center of the circle, is 3jcVi-jc . Now suppose the first projected 
point falls at distance x from the center of the circle. Our random chord will 
intercept an arc of half angle between 9 and 8 + Ad if and only if the projection 
of the second random point falls in the region AUB described for Model IV. 
Thus the probability that we seek is just the volume of the sphere directly above 
and below the region AUB divided by the total volume of the sphere, \tt. By 
symmetry the volume above and below AUB is twice the volume above and 
below A. We shall get the volume above and below the region A by considering 
elements of area in A which are at a distance between 17 and 77 + A17 from the 
center of the circle (see Fig. 6.14a). For each value of rj there are two arcs whose 
lengths we shall call Afi and Af 2 such that these arcs comprise the points in A at 
distance 17 from the center of the circle. Similarly, the two thin strips of length 
Afi and At 2 in Fig. 6.14a contain all the points in A at distance between rj and 
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t} + Atj from the center of the circle. The volume in the sphere above and below 
these strips is just the area of the two strips times twice t he hei ght of the surface 
of the sphere above the strips, i.e., [Afi + Aj 2 ] At/^Vi-tj 2 ). We must now 
obtain expressions for Af i and A.t 2 in terms of r\. For brevity let p = cos 0, the 
distance from the center of the circle to the shorter of the two chords making up 
the boundary of the region A. As before, let A/3 denote the angle between these 
two chords. Let us consider the arc Awj as shown in Fig. 6.14b. 




width At) 



projection of 
first random point 



If- 



angle 
approximately 
ill for small Ap 




projection 

of first 

random point 



Fig. 6.14a 



Fig. 6.14b 



The length of Awi is 
(6.42) Aw! = (>/* 2 -p 2 + >/i7 2 -p 2 )A/3. 

For every small A/i and Avvi these arcs can be assumed linear and by the same 
argument as used in Model IV to show that the width at the rop of the region A 
is s\ sin 6 we see that 

Awi = Afi sin t/f 



where t/> is defined in Fig. 6.14b (iA = cos ' (p/17)). Equating our two expressions 
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for Awi and solving for Afi we get 



(6.43) 



ir\ -p 
In the same manner we obtain 



sin ip 

vr 



(6.44) 



so that 



At 2 = < 



Vrp^p- 



^=^(Vt, 2 - P 2 -Vx 2 -p 2 )A,8, tj>x, 



l.VT7'-p 



2r? A/3, 

Afi + Af 2 = < J 2T?Vx 2 -p 2 
Vtj -p 



TJ>JC, 



The volume of that portion of the sphere which is projected onto the two strips 
Afi and At 2 is 

Volume projected on At i and At 2 = 2[A/i + Af 2 ]v 1 - tj Atj 
(6.45) 



tj>*, 



4A/3tjV1-t/ Atj, 

. . ^W* ~P ^1 _ T7 2 , 



If we now take At/ to be infinitesimal and integrate over all values of 17 between 
p and 1 we get the total volume of the portion of the sphere which is projected 
onto the region A (except for terms of order (A/3) 2 ) 

Volume projected onto A 



= 4 A/?[ VP^ f ^^ ^ + [ \ VT^ dJ 



(6.46) 



p vi) -p 



2A(h{x 2 -p 2 )->l\-x 2 + {\-p 2 ^x 2 -p 2 \*cC x yjj- 

2 



1 1 

JL 
x 2 



+fd- *T 2 ]. 



As in the previous two models we have 

sin 6 AO sin Aff 

W = 1 1 ~T~ = n — t- 
•Jx -cos 6 six -p 
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Thus 

Volume projected onto A 



I 2 2 

L * 1-x 



1 ~~2" t ,-. „2\3/2, 



3 V* — p 



Given the distance x of the first projected point from the center of the circle, the 
probability that our random chord intercepts an arc of half angle between and 
+ A0 is 

P[(e, o + ab)\ x ^ Volum * P roject f °" to A u B 

Volume of sphere 

_ 2 (Volume projected onto A) 

, , = (4/3)71- 

(6.47) _ _ rj _, 

P 



= ^sin6beU(\-x 2 )(x 2 -p 2 ) + (l~p 2 )t3Ln-' yy 



2(l-x ) 

+ - 



2 



2\3/2, 



3 vx -r 



P 

As in the previous two models we now integrate over the density of x from cos 6 
(wh ich we are calling p) to 1. Remember that in this case the density of x is 
3*vl —x . 



P[(6, + A8)] = -sm0b9 f \j(\-x 2 )(x 2 -p 2 ) + (l-p 2 )lan~ l J^— 



2 
X 



+ - — , . , \x\l—x dx 



3 jp=p\'^ 



9 
(6.48) = — sin <9 A6> 



77 



IX —p 
Ll |(l-p 2 ) 5/2 + ^(l- p 2 ) 5/2 + ^(l- P 2 ) V2 



= — sin0A0(l-p 2 ) 5/2 

5tt 

32 
= — sin 6 &0 (since p = cos 0). 

577 

Hence for our final randomization model 

f(0) = ~ sin 6 8 (OS6»Stt-/2). 
We are indebted to Dr. William A. Visscher 2 for the following way of developing 



f(6) for Models V and VI. 



' Dr. Visscher of Los Alamos Laboratories attended the Lecture Series at the University of Nevada 
and submitted the derivation above by letter very shortly after the lectures terminated. 
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Take 2 points at random uniformly in an n -dimensional sphere of unit radius. 
Project the straight line connecting them onto a circular projection of the sphere. 
What is the distribution of lengths of chords thus found? 

Definitions. (p u <pi), (p 2 , <p 2 ) are projections onto the circle of the two 
random points in « -dimensional spheres. One can read from Fig. 6.15 that 



(6.49) 



(<P2-<Pl| = <P = 



cos 6 cos 8 

arc cos ± arc cos 



Pi 



Pi 



i.e. for each value of (8, pi, p 2 ) there are either two values of <p (see dashed line) 
or none (if p 2 = min (pi, p 2 )<cos 8). The density function for cos 8 is then 



(6.50) 



/(cos 8) = E{S(cos - C(<p, pi, p 2 ))} 



where C is the solution of (6.49) for cos 8, and E means averaging the 2 value 
points over the n -sphere volume. By construction we have \ /(cos 8) d cos 8 = 
1. The volume integral in n -dimensions is 



(6.51) 



. 1 


r 27r 


pdp 

J 


dip 





dxi 



dx n 



X3+--- + X„<l-p^ 



V n - 2 



,1% 



pdp 



d<p(l-p z ) 



2v(n-l)/2 



where the volume of a sphere in n -dimensions is 

V„ = v n R"; 



1 n 4n 

Vq=1, Vi = Z, V 2 =7T, Vi = —, l'4 = 7T 



8tt 
0s = l5"' ' 6: 



3 t c. 3 

V _ 16-7T 




Fig. 6.15 
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(6.52) 



v n Jo 



2)/2 



p 2 dp 2 (l-p2) 



2s(n-2)j2 



dip S (cos 6 - C(<p, p u p 2 )), 



and the integral over cp can be done, using 

(6.53) 5(cos0-COp,p 1 ,p 2 )) = s(<P 

and 

Sep 



cos cos 
arc cos ± arc cos 



Pi 



P2 



Sep 



S cos 



(6.54) 



«5cos0 



vp<-cos 



after the effect of 2 values of tp for each is accounted for. With (6.54) inserted 
into (6.52), and the p 1; p 2 integral split into p 2 <pi and p2>Pi parts which give 
equal contributions, the integration is straightforward and gives 

/(cos0) = ^|^f 1 P id Pl (l-pV 

V n -'cose 
(6.55) 

8ttvI- 2 (n-l)!2" 



2v(«-2)/2 



f P2 * 2 (l-p|) ( - W2 

■'cos 9 



vp< — cos 



n v„ l-3-5---(2n-l) 
This gives, for n = 2 



sin 



and for « = 3 



/(cos 0) = — sin 3 6» 

277 



32 
/(cos 0) = — sin 5 6». 

077 



We now use the above expressions for /(0) to calculate E(w), Var (w), £(r) 
and Var (r) for each of our randomization models. For example, for Model I we 
have 



E(a l2 )=-\ f 1 6 2 f(6 2 )f(e l ) d6 2 rffli 

77 J Jo 



4 
7T J 

16 

3 
77 

1 

'3' 



rr/2 







o \tti\tt/ 



d$ 2 d6x 



„/2 ^2 



d0. 



16 l/77\ 3 
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Thus the expected number of intersections within the circle between n random 
chords and an additional random chord is 

E(w) = nE(ai 2 ) = -n 

and the expected total number of intersections between the n random chords for 
Model I is 

£(r) = ^«(n-l)E(a 12 ) = gn(n-l). 

To get the second moments of w and r for Model I we calculate 

A j. Tf/2 j- , «, , -, s r 77/2 



32[" /2 \6 2 n tit \] 2 Jn 



~15' 
Hence substituting into our equation 

E(w 2 )= nE(a io ) + n(n - \)E{a io a jo ), i #/, 
we get 

and 

Var(w) = -^ — . 
Our expression for the second moment of r 

E(r 2 ) = ^[2n(n-l)E(a l2 ) + 4n(n-l)(n-2)E(a 12 a l3 ) 



yields 



and 



+n(n-l)(n-2)(n-3)(£(a 12 )) 2 ] 



E(r 2 ) = ~^n(n-l)(5n 2 -n + 12) 



Var(r) = . 
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The calculation of the first two moments of w and r for the other five ran- 
domization models can be carried out in exactly the same manner as for Model I 
above. The results of these calculations are summarized in Table 4. 

The column of primary interest in this table is the column containing the 
values of E{a^), the probability that two particular chords intersect within the 
circle. As was remarked earlier, it is a feature of chromosomes that the chords 
joining homologous pairs intersect about 80% of the time, that is, E(a t i) for the 
chords joining homologous pairs of chromosomes is about .8. No randomization 
model we have looked at gives such a high value. Model VJ is closest with the 
value of .728. We could easily devise a model, for example with f(9) = 
(512/(637r)) sin 10 6, which would give a value of approximately .8 for E («;>), but 
it is doubtful whether we could justify such a model on the basis of the physical 
set-up. The pairs of chromosomes seem to be balanced with respect to the center 
of the circle, thus causing the chords drawn through the pairs to intersect more 
often than if the members of each pair were random with respect to each other 
and the center of the circle. 

More results on random chords. We can derive additional results concerning 
the intersections of random chords for several of the randomization models 
which we considered above. For Model I, for example, where the random chord 
is determined by two points independently and randomly placed on the circum- 
ference, a simpler derivation of £(ai 2 ) was given by W. A. Whitworth in Choice 
and Chance (1897). Not only is Whitworth's method simpler, but it applies to 
random chords in any convex region, and it yields the probability mass function 
of w, the number of intersections between n random chords and an additional 
random chord. For this approach we shall normalize the circumference of our 
convex region to be unity. 

First we shall derive E(a 12 ), the probability that two random chords intersect 
within the convex region. Let the first chord intercept an arc of length x. Then x 
is uniformly distributed on [0,1]. The two chords intersect if one of the end- 
points of the second chord falls on the arc of length x and the other falls on the 
remainder of the circle (of length 1 — x). There are two ways this event can 
happen, namely, the first endpoint can fall on the arc of length x and the second 
outside, or vice versa. Thus we have 

E{ot\ 2 \x) = 2x(l- x) 

and 



f 1 1 

E(a 12 )= 2x(l~x)dx = - 

Jo J 



which agrees with the result for the circle by our previous method. It is 
interesting to note that for this randomization model E(an 2 ) is independent of 
the shape of the convex region. 

E(ai 2 a i3 ) is equally easy to derive by Whitworth's method. Given that chord 
1 intercepts an arc of length x, the event that chord 2 intersects chord 1 is 



Table 4 



Model 


Method of 
determining 
random chord 


/(») 


£<«(/) 


E(w) 


E(r) 




Var (m>) 


Var (r) 


I 


Two random 
points on 
circumferenc 


2 

IT 

i 


1 

3 
(.333) 


1 

— n 
3 


Tfi(n-l) 
o 


2 
15 


n 2 n 

1 — 

45 5 


n(n-l)(n+3) 
45 


II 


Uniform 
distance 
to origin 
and uniform 
angle with 
axes 

(h dpd<p ) 


sin 6 


1 

2 

(.500) 


1 
-n 

2 


-»(#!- 1) 

4 


8 
3tt 




" ( "- i) [( 3 ^4)" 


HI 


Midpoint 
random 
in the 
circle 

y—pdpd<pj 


2 cos fj sin 8 


3 
8 

(.375) 


3 
— n 

8 


^-(-1) 


3 1 
16~3ir 2 


\64 3tt 2 / 

+( — + 51" 

U6 3w 2 / 


n{n ~id-^y 

\128 3ir 2 /J 


IV 


One random 
point in the 
circle and 
a random 
direction 


4 . 2 
— sin 6 

IT 


1 5 

_ + 2 

3 2tt 2 
(.587) 


(\ + ^y 


(i^)" -" 


2 1 49 
15 it 2 4ir 4 


/I 2 6\ 2 
lii"3V + ^J M 

/ 1 3 49 \ 
+ l5 + 2V-4^J" 


n(n-l) 

•[(i"3^ + ^)" 
/ ! 7 121 M 



Table 4 (cont.) 



Two random 
points in 
the circle 






- sin 



1 245 

- + = 

3 72ir 2 

(.678) 



/ 1 245 \ 



245 



\6 144 
•n(n-l) 



2 155 
- + - 



15 I08ir' 
97097 



5184w 



J 5_ 2317 

45 6ir 2 324tt' 

1 425 97097 



>■ 



+ \5 + 2167r i 



5184 



— n 

IT / 



n(n-l) 



[(5- 



5 2317 
~6^ 2 + 324t/ 



V 



'_!_ 965 

,T5 + 432ir 2 



208313 \1 
3 4 2 7 ir 4 JJ 



VI 



Two random 
points in 
a sphere 
projected 
onto the 
plane 



32 
5tt L 



-sin 



1 7007 
3 1800ir 

(.728) 



l\_ 7007 \ 
\3 + 1800ir 2 / 



\6 3600ir 2 / 



n(n~\) 



2 1519 
15 900-rr 2 
73763833 



(s 



49 



45 54 
l 027741 



.,4,4,6 4 
3 5 2 IT 



741\ 2 
3-4)" 

IT / 



/l 441 

+ - + 

\5 200 



77- 



73763833 

-»4 c 4-6 4 

3 5 2 ir 



n(n-l) 



[( 



49 



5 54ir 

102774 1 

"3 3 5 4 2V 4 

1 8869 

1 

*15 



)■ 



<T, 



36007 

29552237\ 



552237\] 
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independent of the event that chord 3 intersects chord 1. Each of these events 
has a probability of 2jc(1 - jc). Thus 

E(a 12 a 13 \x) = 4x 2 (l-xf 



and 

E(a l2 ai 3 ) = 



1 2 

4x 2 {\-x) 2 dx= — 
, 15 



as before. 

With this same approach we can obtain the probability mass function of w, the 
number of chords intersecting one particular chord, which we shall call C. If 
C intercepts an arc of length x (O^xgl), the probability of another chord 
intersecting C is, as before, 2x{\—x). The probability of another chord not 
intersecting C is \-2x{\-x\ which can be written as x 2 + (l-xf. Thus the 
probability that among n random chords exactly w of them intersect C is 

p( W \x)=[^)2-x w (i~xr[x 2 +(i-xfr w 

= ( n )2"x w (i-xyY( n ~ w )x 2k (i-x) 2in - w - k \ 

The probability that, regardless of the length x of the arc intercepted by C, 
exactly w chords intersect C is 

p{w)= (^j2 w "f (" ~ W ) | ' * 2fc+w (l -xf n — 2k dx 

_( n\ n ~ w in- w\{2k + w)l(2n - w -2k)\ 
\wl k=o V k I 



(2n + l)! 

\ W ) Y{ n- W y,2n v (w . 0tl>2i ... iny> 

n + 1 fc=o \ A: // \2k + wJ 



2« + i fc=c 

Another form of this expression given by David and Fix (1964) is 

/ x 1 % w /2k\/ n + 1 \ //2w+2k + l\ , 

P(w)= , , 1M ,- 2l , I ( . )( .fr^J/i ^^ w = 0,l,2,---,n). 

(« + l)2 k=o\k/\w + k + l// \ w + k I 

It is rather surprising that the distribution of w is totally unaffected by the shape 
of the convex region. 

The probability mass function of r is not known at present except for a few 
special values of r when the random chords are defined as in Model I. The values 
of P[r = 0], P[r = 1], P[r = (n(n - l)/2)- 1] and P[r = n(n - 1)/2] for Model I 
were derived by Whitworth. The key to the derivation is illustrated in Fig. 6.16 
below. 

The two regions have different shapes and the endpoints of the chords are 
spaced differently on the perimeters of the regions. On the other hand the 



RANDOM CHORDS IN THE CIRCLE AND THE SPHERE 



155 




Fig. 6.16 



configurations of the chords in the two regions are identical in a certain sense; 
namely, the order of the endpoints around the perimeter is 3, 1, 2, 4, 3, 4, 1, 2 
for both regions. Clearly the number of intersections, r, is completely deter- 
mined by the order of the endpoints around the perimeter and is unaffected by 
the shape of the convex region or the spacing of the endpoints on the perimeter. 
Thus without loss of generality we can fix 2n points on the perimeter of the 
region to be the endpoints of the chords. Under Model I each pair of points is 
equally likely to be connected by a chord. (This is not true for any of the other 
randomization models.) Thus we can compute P[r = /] by finding the number of 
ways our 2n points can be connected forming chords with / intersections and 
dividing this number by the total number of ways the 2« points can be joined to 
form n chords. 

First we shall derive the number of ways that 2n points on the circumference 
of a closed convex curve can be connected to form n chords. From the first given 
point we can draw a chord in 2n — 1 ways, from the next point in In — 3 ways 
(since the first chord used up two points), from the next in 2« — 5 ways, and so 
on. Hence the number of ways is 



1 • 3 - 5- - -(2n-l) = 



1 '2-3 



In (2n)\ 



2-4-6---2/I nl2" 



We shall now derive P[r = 0], Let a n denote the number of ways that 2n 
points on a closed convex curve can be connected to form n chords with no 
intersections within the region. Clearly a.\ = \. Any chord must divide the 
remaining 2« — 2 points so that an even number remain on either side, say 2k on 
the left side and 2n - 2k - 2 on the right side. The 2k points on the left can then 
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be joined to form k nonintersecting chords in ak ways, and the 2n - 2k - 2 
points on the right can be joined to form n — k — 1 nonintersecting chords in 
a n ~k-i ways. Thus if the first chord leaves 2k points on its left side, the 
remaining chords can be drawn in a fc a n - fc -i ways. (If k = or k = n - 1 then all of 
the remaining chords are on one side of the first chord and they can be drawn in 
a„-i ways.) Since k can have any value from to n — 1 we have for n > 1 

The expression on the right hand side is the coefficient of x n ~ x in the power 
series (1 + a\X + a 2 x 2 + • • -) 2 . Therefore 

ai + d2X + aix +■ ■ • = (l + aiX + a2X +•••)• 

Let 5 denote (1 + aix + a 2 x +•••)• Then the above equation can be written 

^- = S 2 or xS 2 -S + l = 0. 



The solution is 



1±vT-4jc 
S ~ 2x 

We shall choose the minus s ign as the plus sign yields a nonsensical answer. The 
series expansion for v 1 -Ax is 

(l-4»)"^l4(4,) + Mzip; jg^)a-2)(4»)' + ,.. 

The coefficient of *" in this expansion is (for n ^ 2) 

(-l)"4'*(M-l)(5-2)(§-3)- ■ • fi-« + l) 

Cn = 



n\ 




(-l)"2-(l-2)(l-4Xl-6)---(l-2(n- 


-1)) 


n\ 




(-1)"(-1)" _1 2"(1 • 3 • 5 ■ • ■ (2n -3)) 




n\ 




2" (2/1-2)! 




«!2-4-6-- -(2n-2) 




2(2«-2)! 





nl(n-l)! 
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Hence 



l-Vl-4x 
S ~ 2x 

l-l+2x + l n=2 — —x 

_ n\(n-iy. 

~ 2x 

(2n)\ 

= 1+ X "" 



(n + l)!n! 
Substituting in the definition of 5 we have 

(2n)\ 



1+ X a„x n = l 



= i 



Equating the coefficients of x" we get 



(2n)! 



(n + l)!n!' 



We get P[r = 0] by dividing a n , the number of ways of drawing n nonintersecting 
chords, by the total number of ways to draw n chords, i.e., 

p[r = Q] ^ (2n)! /C in)l 2" 



(n + l)!n!/ n\2" (n + 1)!' 

In order that there is one and only one intersection among the n chords, we 
must have two chords which intersect and which are not intersected by any of the 
other n — 2 chords none of which intersect each other. The two chords which 
intersect must divide the remaining 2« —4 points into four groups each contain- 
ing an even number. Chords must be drawn connecting the points in each of 
these groups so that there are no intersections. Thus if there are 2k points in a 
group, there are a k ways of connecting the points so there are no intersections 
among the chords of that group. Thus if there are 2k points in the first group, 2/ 
in the second, 2/ in the third and 2m in the fourth (so that 2fc + 2/ + 2; + 2m = 
2n —4), then there are a k aja t a m ways of drawing the remaining n—2 chords so 
they have no intersections with each other or with the two intersecting chords. If 
we draw one of the intersecting chords through a particular point A from among 
the 2n points on the circumference, the number of ways of drawing the remain- 
ing chords is Y.k+i+i+ m = n -2 akdjaia m . The point A can be chosen in 2n ways, but 
each arrangement will then be counted four times. (For example, the arrange- 
ment shown in Fig. 6.17 is counted once for A = 1, A = 2, A=5 and A = 6). 
Thus the total number of ways of drawing the n chords with one intersection 
is (n/2)X fc+>+j+m=n _ 2 a fc a/a,-a m , which is n/2 times the coefficient of x n ~ in 
(1 4- a\x + a 2 x 2 + a 3 x 3 + ■ • -) 4 , i.e. S 4 . In the derivation of P[r = 0] we found that 

5' = ^. 
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We also found that 
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4 5-25 + 1 5-l-2x5 + x 

O — ^ — t 



5 = 1+ I 



(2n)\ 
„tm!(n + l)! 



x . 



Hence 



4 _ 5 - 1 - 2x5 + x 
5 — 5 



= — 1 + U+ I x -1-2x-2 I x + x 

xL \ „=2n!(n + l)! / „=m!(n + l)! J 

= J_r - / (2»+2)! (2n)! \ +1 ] 

x 3 L-i\(n + l)l(n + 2)\ n\(n + 1)1/ J 

= - (/i-lX2n)! „_ 2 
„- 2 n!(n + 2)! * ' 

Thus the coefficient of x"~ 2 in («/2)5 4 is n{n - l)(2n )!/(«!(« + 2)!). This is the 
number of ways the n chords can be drawn with one intersection. Thus 



P[r=l] = 



n(n- 1X2/i)! I (2n)\ _ 2 n n (n - 1) 
nl(n + 2)\ I n\2 n ~ (n+2)! ' 



The values of P[r=j] for 1</' < n(n - 1)/2 - 1 are not known. We can derive 
P[r = n(n -l)/2-l] as follows. In order that r be n(n-l)/2-l (i.e., one less 




Fig. 6.17 



RANDOM CHORDS IN THE CIRCLE AND THE SPHERE 159 

8 9 




Fig. 6.18 

than the maximum possible number of intersections), we must have two chords 
which do not intersect but the other n — 2 chords must all intersect each other as 
well as the two nonintersecting chords. This can be done only if the two 
nonintersecting chords are adjacent (see Fig. 6.18) and there are « — 2 points on 
each side of the nonintersecting chords. Each of these remaining points must be 
connected with the point opposite it on the boundary. (If the points are 
numbered consecutively then point 1 must be joined to point n + 1, point 2 to 
point n + 2 and so on, except for the endpoints of the nonintersecting chords.) 
Thus once the two nonintersecting chords have been chosen there is only one 
way to connect the remaining In -4 points so that the n -2 chords all intersect 
the two nonintersecting chords and each other. There are n different ways to 
choose the two nonintersecting chords. Thus there are n ways of joining the 2n 
points so that the chords have n(n-l)/2-l intersections within the region. 
Hence P[r = n(n — 1)/2 — 1] is n divided by the total number of ways the 2n 
points can be joined, that is 

r n(n-l) I nn\2" 

p [ r= -~2 — l r^y.- 

For each chord to intersect every other chord, each of the 2n points must be 
joined to the point opposite it. That is, if the points are numbered sequentially 
around the boundary, point 1 must be joined to point n + \, point 2 to point 
n+2, and so on. There is, of course, just one way (out of the (2n)!/(n!2 n ) 
possible ways of joining the 2n points) that this can be achieved. Thus the 
probability that the total number of intersections within the convex region is 
n(n — 1)/2 is 

n(n-l)l n\2 n 



i-^Y 



(2«)! 
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The above methods for deriving P[r = 0], P[r = 1], P[r = (n(n - 1)/2)- 1] and 
P[r = n{n- 1)/2] are applicable only to Model I and do not extend to the other 
models. On the other hand, some of the above results, namely E(ai2) for an 
arbitrary convex region and the mass function of w (for the circular case), can be 
obtained for Model II also. First we must restate the Model II randomization 
model so that it applies to any convex region C. Let O be any point in C. If the 
perpendicular from O to the extension of the random chord has length p and 
makes an angle $ with some axis, we want the joint density of p and <t> to be 
proportional to dpd<j> as in the circular case. (See Fig. 6.19.) In our previous 
discussion of random lines of a Poisson field which intersect a convex region, we 
found that the density of such lines is L~ dp d<f> where L is the perimeter of the 
region. Thus we see that random chords in a convex region C as defined by 
Model II must have the same distribution as random chords formed by random 
lines of a Poisson field intersecting C. We have already obtained several results 
for random chords formed by random lines intersecting a convex region. The 
probability that two chords intersect within C is 

E(<xi2) = -jj- 

where A is the area and L the perimeter of C. For the case where C is a circle we 
get £'(ai2) = 2, which agrees with our previous result. 



random 
chord 




FIG. 6.19 

Another result from the section on random lines intersecting a convex region 
is the fact that the probability that a random chord intersects a line segment of 
length / in C is 21/ L. Thus if a random chord in a circle of unit radius has length 
/, the probability that it is hit by another random chord is 1/tt. We shall use this 
fact to derive the mass function of w for Model II in the circular case. The 
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method is quite similar to that used above in the derivation of the mass function 
of w for Model I. If p is the distance from the rando m cho rd to the center of the 
circle, p is distributed uniformly on [0, 1] and / = 2Vl-p 2 . Thus we have 



P[Intersection \p] = 
and 



2VT^7 



2 

F[No Intersection | p] = 1 Vl-p 2 . 

The probability that n random chords intersect a chord of length / exactly w 
times is 

Thus the unconditional probability mass function of w is 

\wl d=o \ k >\vi Jo 

Letting x -p 1 , we get dp = \x in dx and 

^-K:)X«-"-"Tn(!n , '-" j « 1 -^"-" 

_i/«\ n v w / 1 .„_ w ^ fc /«-K'\/2\"- fc r(i/2)r((/t-iit4-2)/2) 



;c^>-("D© 



2\w/ fc t v ' V k l\v) T((n-k + 3)/2) 

Hence we have shown that the probability that a chord chosen from among n + 1 
random (Model II) chords of a circle is crossed exactly w times by the remaining 
n chords is 

n - w - k {n- W \{2y-*Y{{n-k+2)/2) 



^^0 ;?:<-»-'( v)© 



r((n-k + 3)/2Y 



Random chords of a sphere. In this section, we will consider the distribution of 
the length of a random chord of a sphere, under eight different randomization 
models. The developments are due to Berengut (1972). This treatment will be 
somewhat analogous to the discussion of random chords of a circle, under 
various models, considered earlier in connection with the paper of David and Fix 
(1964). We will assume in all cases that the sphere has unit radius, and we will let 
L be the random variable denoting chord length, with density /(/). 
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Model 1. Chord joining two points uniformly distributed on surface of sphere. 
We can let the position Pi of the first point be arbitrary. Then the Pr {/ < L < I + 
dl} is 1/(4it) times the area of the circular band in Fig. 6.20. 




length / 



o 
t 




(b) Cross-section through P x . 
Fig. 6.20 

Now the width of the band is ds = sec <p dl. But q> is the angle between P X B 
and the tangent to the circle at B. Hence by elementary geometry, <p is equal to 
the angle subtended on the upper part of the circle by the chord P\B. But this 
angle is just (tt/2)—6. Hence <p = (ir/2) — d, and therefore ds = esc 6 dl. Now the 
area of the band is ds times its circumference, which is 2tt ■ I sin 0. Therefore, 



(6.56a) 
(6.56b) 



Area of band = cos 6 dl ■ 2trl sin 6 = 2irl dl ; 

Pr{KL<l + dl} = -?--2irldl=l-dl; 
4tt 2 
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and 
(6.56c) 

(6.56d) 

(6.56e) 



f(l)=l/2, = 



.3,2 



«o-a*-[g.4 



Var (J) 



16 
9 



Model 2. Chord in random direction from point uniformly distributed on 
surface. Let the position of the point be P. Let 8 be the angle between the 
random chord through P and the diameter through P. (See Fig. 6.21.) Since the 
chord PQ is randomly directed, we know from a previous result that the density 
of 8 is proportional to sin 8, i.e., 8 has density g(8) = sin 8, 0^dSw/2. Now 
triangle PQR is a right triangle with hypotenuse of length 2; hence / = 2 cos 8. 
Thus \dl/d8\ = 2 sin 8. The density of / is thus f(l) = g(d)/\dl/d0\ = 
sin 8/(2 sin^) =1/2; i.e., 



(6.57) 



/(/) = |, 0^/§2, 



£(0=1, Var(/) = j. 




Model 3. Chord in random direction, distance from center uniformly dis- 
tributed. Let r be the distance of the chord from the center of the circle. Then 



/ = 2VT 



2(-2r) 



271^7 



2r 



VT 
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Since r is uniformly distributed on (0, 1), 



/(/)= * = ^Eg= m = _i_ 0as/ss2 . 

m 2r/ST? 2r 2Vw 2 /4 2s^?' 
(6.58) 



1 r r 



Let w = (4-/ 2 )/4. Then / = 2Vl-«, du = -(1/2) dl; therefore, 

2 Jo 2^ 2^1^ 
= Cu- 1/2 (l-u) l/2 du 

•"0 



/3\T^/1 



r(l)r® i r r 7T 

2 r(i>r(2) _ 2^ 8 

r(f ) f • |vTT 3 



E(l 2 ) = 2 f « _1/2 (1- M )d«= : 
Jo 



therefore, 



*r(0-f-£ 



Model 4. Center of chord uniformly distributed inside sphere; random 
direction. Let r be the distance of the chord (i.e., the center of the chord) from 
the center of the sphere. 



Pr(r SrSro + rfr) = Pr(rSro + dr)-Pr(rgro) 

1(1^ + *?) 1(1^) 



= {r + drf-rl 

= 3rodr + o(dr). 

Hence r has density 

(6.59) g(r) = 3r 2 , OSrSl. 

Now / = 2Vl-r 2 , r = V(4-/ 2 )/2, |d//dr| = 2/-/Vl-r 2 . Hence the density of / is 
given by 

/(0=3r 2 



Vl-r 5 


=±Wi- 


-r 2 = 


3 V4-/ 2 


/ 


=±/V 4 - 


-/ J , 


2r 


2 






2 2 


2 


8 





(6.60) 

£(0=1 /V4~w 2 <//, 0^/2i2. 

8 Jo 
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As before, letting u = (4- / 2 )/4, we get 



£(/) = 



8 J 



4(1 



u)-2u 112 - 2 l/2 du=3\ « 1/2 (1-«) 1/2 
z.(i~u) Jo 



du 



.3rd)r^) = 3-(7r/4) = 37T 
f(3) 2 8 ' 



Similarly, 



E(l 2 ) = 6 



1 wn w 6r(|)T(2) 6-^ 8 
u (1 — u)au ~ 



Hi) 



5 3 1 /— «' 



v„(0-f-^. 



Model 5. Chord in random direction through point uniformly distributed 
inside sphere. Let t be the distance of the random point P from the center of the 
sphere. Let 8 be the acute angle between the chord and the radius through P. 
Then the distance from the center to the chord is r = t sin 0. Figure 6.22, which 
gives the cross-section through the sphere containing the chord and the center of 
the sphere, illustrates this. Now, since the chord through P has random direc- 
tion, we know that has density g(0) = sin 6, 0^d^ir/2. Let t be given. Then 
dr = t cosd dS. Thus the conditional density of r, given t is 



(6.61) 



h(r\t) = sm 6 



1 



tan 



t cos 6 



tJ? 



0: 




Fig. 6.22 



Now since P is uniformly distributed inside the sphere, t has density g(t) - 3t , 
g t ^ 1, as was demonstrated previously in Model 4. Hence the density of r is 



h(r)=\ h(, 



r\t)g(t)dt = 



r tSF^? 



3t 2 dt 



-fe 



:dt 



3r [ du 3r 1/2l i- rJ , n 1 



"3 
■r , 
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that is, 




(6.62) 


h(r) = 3rJl-r 2 , OSrSl 


Now / = 


= 2Vl -r 2 , \dl/dr\ = 2rf-Jl -r 2 . Therefore, 




ao-Wi-^. 2r -fa-^,-|/' : 


(6.63a) 


3 f 2 3 3 




E ^-\\/ d "k*-f- 


therefore 




(6.63b) 


v m 12 9 3 

V3r(/)= 5-4 = 20- 



OS/^2, 



Model 6. Chord through two points independently uniformly distributed 
inside sphere. Let the position of one of the points Pi be at a dista nce t fr om the 
center. We will first condition on t. Consider Pr(Ls/|?), for 2Vl-/ 2 g/^2. 
This is just \/(4it) times the volume inside a pair of cones inside the sphere, with 
axis the diameter through Pi. Figure 6.23 illustrates the region. 





height 
of cap 



(a) 



(b) Cross -section containing A, B, C, D. 



Fig. 6.23 



We have |PiC| = |PiA| = 6, \PiB\ = |PiD| = a, a + b = l. Also, \P x E\ = y, 
\OF\ = x. (If F lies on the same side of O as Pi, then we take x to be negative.) It 
is clear that L ^ / if and only if the second point P 2 lies inside the regions 
bounded by the sides of the cones. Now the region above Pi (having A and C as 
boundary points) consists of a cone with flat base of height y, plus a spherical cap 
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of height 1-t-y. Similarly, the region below P l (having D and B as boundary 
points) consists of a cone of height x + 1 and a spherical cap of height 1 - x. We 
want to obtain the volume of these regions in terms of t and /. 

Consider a spherical cap of height h, Ogli §1. Its base is a circle of radius 
Vl-(l-fc) 2 . (See Fig. 6.24.) 




Fig. 6.24 
Now, clearly, the volume of the cap is given by 



V(h) = 



.s/l-(l~h)2 



y\-r-{\-h)}rdrdd 



M 



3/2 



2 N 3/2,r=0 



] ; -2tt(1- 



h) 



\~{\-h) 1 



= 7r[~-(l-h) + ~(l-hf], OS/,511. 



Now for lsft<2, V(h) = (4/3)TT-V(2-h)^Tr((2/3)-(l-h)+(l/3)(l-h) 3 ). 
Thus 

V(h) = ir(l-(l-h) + ^(l-h) 3 y OS/^2. 



(6.64) 

Hence the volume of the upper cap is 

/2 



(6.65) 



v,-^-(, + o + tt±ii-) 



Now, for the lower cap, since we take x to be negative for h > 1, we have 
1 - h = x in all cases. Hence the volume of the lower cap is 

.3. 



(6.66) 



V 2 = -rr{--x + 



t)- 
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Now the upper cone has base radius Jb 2 -y 2 and height y ; hence its volume is 

(6.67) V 3 = jTr(b 2 -y 2 )y. 
Similarly, the lower cone has volume 

(6.68) V 4 = ^ir(l-x 2 )(t + x). 
Adding (6.65)-(6.68), we get 

(6.69) V=V 1 +V 2 +V 3 +V 4 = ir[l+(j+t 2 -l)y + ty 2 -~x~x 2 -j+ t -}. 

We want to express V in terms of t and /. To do this, we have to find expressions 
for a, b, x, and y in terms of t and /. 

Now, from elementary geometry, we know that for two intersecting chords of 
a circle, the product of the segments of one is equal to the product of the 
segments of the other. Applying this to chords AB and GH, we get ab = 
(1 - 0(1 + 0- Now using the relation a + b = /, we can solve a quadratic equation 
to obtain the relations. 



(6 . 70) ._jwp^E3 

(S . 7i) 4 _t^EffiE3. 

We also have the relations b 2 ~y 2 = l-(y + tf and l-x 2 -a 2 -(t + xf, from 
which we obtain the following expression for x and y : 

(6.72) 



„2 .1 ! 

a — t — 1 



It 
(6.73) y = 1 " 



\-t 2 -b 2 



It 
Using (6.70) and (6.71), we obtain finally 



(6.74, x _ e+*+WF=iy=?) t 

At 

Substituting these expressions into (6.69), after considerable simplification we 
obtain 



(4 /V/ 2 -4(l-f 2 )\ o/ — 5 



(676) v=7r \3 12, )■ 2 ^ 



ri/§2. 
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Now since Pr (L > I \ t) = (4tt/3)~ 1 V, for 2^1 -t 2 S / § 2, we have: 



(6.77) Pr(L>/|0= \ 16* ' 

(l, 0S/S2V1-? 2 . 

Now f, being the distance of Pi from the center O, has density /(f) = 3t 2 , as we 
saw before in (6.59). Hence 



Pr(L>/)= ( ?r(L>l\t)3t 2 dt 
Jo 



(\i J C /W-4(l-f 2 L 2 _, 
= 3t 2 dt-\ — *- — -3f 2 <# 

Jo M4-UV5 16f 



(6.78) =i_lL.f' ,V/ 2 -4(W 2 )d? 

-l~-j-|[{/ 2 -4(l-t 2 ))" 2 ]i:in=mn 

64 
Hence Pr (L§/)= / 6 /64, S / g 2, and the density of L is 

(6.79a) /(,) = ^ /5 ' ° = ' = 2 > 

(6.79b) £(/) = Aj o 2 ^ / = A(|) = i2 ) 

(6.79c) Ei n-±£r<U-±£)-3; 

therefore 

(6.79d) Var(0 = 3-^ = ^. 

Model 7. Chord normal to random plane intersecting sphere, through point 
uniformly distributed inside circle of intersection. By random planes we mean 
planes with measure invariant under rotations, translations and reflections. We 
are interested in the conditional space of random planes which intersect the 
sphere. Let p be the distance of the random intersecting plane from the center of 
the sphere. The random chord is obtained by choosing a point P having uniform 
distribution in the circle of intersection of the plane with the sphere, and drawing 
the normal to the plane through that point. Let r be the distan ce of the point P 
from the center Q of the circle of intersection, OSrSVl-p . (See Fig. 6.25.) 
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Let p be given. Then 



?T(r ^r^r +dr) = — ^[^0+ drf - irr 2 ] 

7r(l-p ) 



2r 
1-P 2 



dr + o(dr), 0S/oS>A-p 



I 



Hence the conditional density of r given p is 

(6.80) g(Hp) = T^ L 5, Og/^l-p 2 . 

1-P 

Now since OQ and the chord are both normal to the same plane, the di stance 
from the cho rd to t he center O is r. Hence the length of the chord is / = 2Vl-r 2 , 
\dl/dr\ = 2r/\l — r. Making the transformation to /, we obtain the conditional 
density of /: 



2 r Vl-r 2 Vl-r 2 I 

(6.81) f(i\p) = - j.— — = - r = — j-, 2pg/g2. 

1-p 2r 1-p 2(1 —p ) 

Now from the theory of random planes, we know (cf. Kendall and Moran (1963, 
pp. 20-22)) that p has the uniform distribution. Hence the density of / is 

f' /2 f' /2 / /f' /2 / 1 1 \ 

AD- J, /Wlrt*-J[ ^7)*-2i (w^ + WT) dp 

-j[ii.a+p)-iog(i-(.)];:i" 

(6.82) £(/) = 7 f / 2 ln4±^/ = i|[ / 2 In (2 + /)<// - f / 2 ln(2-/)rf/ 

4 Jo 2 — / 4 U Jo 

•j. -In 4 -In 2 .. 

= 7 u(e 3u -4e 2u +4e u )du- u(e 3u -4e 2u +4e u )du\ 

4 Ujn 2 J-oo J 

=»-f)-4v^>-<:; 



2 



In a similar manner, we obtain E(l 2 ) = 8/3. Hence, Var(/) = 
20/9 - (16/9) In 2 - (16/9)(ln 2) 2 . 

Model 8. Chord normal to plane of random great circle, through point uni- 
formly distributed in circle of intersection. This randomization model cor- 
responds to the measure obtained by taking the restriction of "random" lines in 



RANDOM CHORDS IN THE CIRCLE AND THE SPHERE 



171 




random plane 



Fig. 6.25 

3 dimensions to those intersecting the sphere, where "random" lines in 3 
dimensions refers to the measure on lines which is invariant under translations, 
rotations and reflections. 

Let r be the distance of the chord from the center. By construction, P is 
uniformly distributed inside the great circle. (See Fig. 6.26.) Hence, applying 
(6.80) with p = 0, we find that the density of r is 

(6.83) g(r) = 2r, O^r^l. 

Making the transformation / = 2Vl-r 2 , we obtain the density of /: 

/ 



(6.84) 



/(0 = 



2' 



Og/^2. 



We observe that this is the same density as that obtained under Model 1, 
where the chord is formed by joining two points independently uniformly 
distributed on the surface of the sphere. Since the measures induced by both 
models are directionally homogeneous by construction, it is clear that the 
measures are identical. 




random 
great circle 



chord 



Fig. 6.26 
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We also observe that E(l), which was shown to be 4/3 in Model 1, indeed is 
equal to 4V/S. Table 5 summarizes our results. 



Table 5 





HI) 






Model 


0S/S2 


EU) 


Var (0 


1. Joining two random points on 
surface 


2 


4 

|= 1.333 


1= 333 


2. Random direction from random 


1 

2 


1 = 1.000 


±=.333 


point on surface 




3. Random direction, uniformly 
distributed distance from center 


/ 


f=1.571 


5-^- 199 
3 4 


2V4-/ 2 




■*» 


^=1.178 

O 


5-^=212 
5 64 


4. Chord center uniformly dis- 


!'•"-'' 


tributed inside sphere; random 


direction 








5. Random direction through ran- 


l>' 


f=1.500 


55-- 150 


dom point inside sphere 


6. Through 2 random points inside 
sphere 


i>' 


^=1.714 


h- Ml 


7. Normal to random intersecting 
plane, through random point in 


/, 2 + / 

4 ln 2^7 


4 2 

^ln 2+|= 1.591 


f-y0n2 + )n 2 2) = 

.134 


circle of intersection 






8. Normal to plane of random great 


I 

2 


same as 


Model 1 


circle, through random point in cir- 






cle of intersection 
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